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ABSTRACT 
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full-scale data collection effort of the Beginning Postsecondary Students 
Second Follow-up Study 1996-2001 (BPS : 1 996/2001 ) . These students, who started 
their postsecondary education in the 1995-1996 academic year, were first 
interviewed during 1996, and subsequently interviewed in 1998. At the 
beginning of BPS: 96/98, more than 12,400 students had been identified as 
potentially eligible for the National Postsecondary Student Aid Study and the 
BPS. About 10,350 of these students participated in BPS: 96/98, and most of 
the BPS : 1996/2001 were these respondents, with a few additions. The 
BPS : 1996/2001 study is the second follow-up of this cohort. BPS : 1996/2001 
included important changes from the BPS: 90/94 follow-up surveys. The data 
collection instrument was considerably refined to reduce -respondent burden 
while still collecting key information on postsecondary enrollment, 
employment, and demographics. In addition, BPS : 1996/2001 was conducted during 
the sixth academic year ,• thus collecting attainment information for students 
who completed their degree in either their fifth or sixth years. Evaluation 
of the procedures used in the full-scale data collection were developed and 
refined as part of the field test conducted in 2000. The methodology report 
contains these chapters: (1) "Overview of BPS : 1 996/2001 " ; (2) "Design and 

Method"; (3)."Data Collection Outcomes"; (4) "Evaluation of Data' Quality"; 

(5) "Data File Development"; and (6) "Weighting and Variance Estimation." 
Seven appendixes contain details about aspects of methodology. (Contains 47 
tables and 28 figures.) ( SLD) 
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Executive Summary 



Introduction 

The 1996 Beginning Postsecondary Students Longitudinal Study (BPS:96), 
sponsored by the National Center for Education Statistics (NCES) in the U.S. Department 
of Education, follows a cohort of students who started their postsecondary education 
during the 1995-96 academic year. These students were first interviewed during 1996 as 
part of the 1995-96 National Postsecondary Student Aid Study (NPSAS:96). In 1998, 2 
academic years after the cohort’s entry into postsecondary education, the first follow-up 
interview (BPS:96/98) was conducted. BPS: 1996/2001 is the second and final follow-up 
interview with the BPS: 96 cohort. This interview, which took place in 2001, focused on 
persistence and attainment among students enrolled in 4-year institutions and 
employment among students no longer enrolled. This report describes the procedures 
and results of the full-scale implementation of BPS: 1996/2001. 

Sample Design 

The respondent universe for the BPS:96/98 and BPS: 1996/2001 interviews 
consisted of all students who began their postsecondary education for the first time 
during the 1995-96 academic year at any postsecondary institution in the United States or 
Puerto Rico. The students sampled were first-time beginning postsecondary students 
who attended postsecondary institutions eligible for inclusion in NPSAS:96 and who 
were themselves eligible for NPSAS:96. 

All BPS:1996/2001 sample members had completed either the NPSAS:96 
interview, the BPS.96/98 interview, or both interviews. At the beginning of BPS:96/98, 
over 12,400 students had been identified as potentially both eligible for NPSAS:96 and 
first-time beginners (i.e., eligible for the BPS interviews). Of those students, about 10,350 
were located and completed a BPS:96/98 interview, with almost 10,300 of them 
determined to be both NPSAS and BPS eligible. The majority of the BPS:1996/2001 
sample consisted of these BPS:96/98 respondents. However, the BPS:96/98 respondents 
were supplemented by a subsample of about 100 BPS:96/98 nonrespondents. The 
BPS. l 996/2001 sample was representative of the students who first began postsecondary 
education in 1995-96. 

Instrumentation 

All sample members were eligible for participation in BPS: 1996/2001, having had 
their eligibility determined as part of either the NPSAS:96 or the BPS:96/98 interview. 
Consequently, the BPS: 1996/2001 interview focused exclusively on activities since the 



last interview. The first section of the instrument collected information on postsecondary 
enrollment and degree attainment. A second section collected information on 
undergraduate education experiences. A third section, on postbaccalaureate education 
experiences, was included for those sample members who had completed a bachelor’s 
degree since the last interview. A fourth section collected extensive employment 
information for the current job if no degree had been earned since the last interview. For 
those who had earned a degree, employment information was collected for the current job 
and for the first job held after degree completion, if different. The final section updated 
the sample members’ family, financial, and disability status and their civic participation 
since the last interview. 

Data Collection Design and Outcomes 

Interviews were conducted using computer-assisted telephone interviewing 
(CATI). Cases for sample members for whom no locating information was available 
were sent directly to a specialized tracing unit for intensive tracing. The tracing unit was 
also used for intensive tracing once all contact information for sample members was 
exhausted during attempts to conduct the telephone interview. 

In addition to telephone interviewing and intensive tracing, field locating and 
interviewing were available for certain cases that fell into any one of 30 geographic 
clusters developed according to the zip code of the last known address for the sample 
member. Potential field cases were those in which CATI and intensive tracing failed to 
locate sample members or in which sample members initially refused to participate in the 
interview. Computer-assisted personal interviewing (CAPI) software was available on 
laptop computers for field interviewing. 

Training 

Training programs on successful locating and interviewing were developed for 
telephone and field staff. Topics covered administrative procedures required for case 
management; quality control; locating; interactions with sample members, parents, and 
other contacts; the nature of the data to be collected; and the organization and operation 
of the CATI and CAPI programs used for data collection. Tracing specialists received an 
abbreviated training specific to the needs of BPS: 1996/2001. 

Interviewing 

CATI locating and interviewing began at the end of February 2001. Contact 
information for the BPS:96/98 respondents was loaded into CATI initially, followed by 
contact information for the BPS:96/98 nonrespondents several weeks after the start of 
CATI. Field interviewing began about 12 weeks following the start of telephone 
interviewing. 

Of the original starting sample, 21 sample members were found to be deceased 
since the last interview. The unweighted contact rate among the remaining 
BPS: 1996/2001 sample members was 92 percent. Of those contacted, 96 percent were 
interviewed for an overall unweighted response rate of 88 percent. 
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Refusal Conversion 

Important to successful interviewing was the ability of the interviewers to gain the 
cooperation of sample members, thereby avoiding a refusal. The telephone interviewers 
included refusal conversion specialists with special training in attempting to convert 
(interview) sample members who have refused to complete the interview. From the point 
when a sample member refused, the case was handled only by these conversion 
specialists. In BPS: 1996/2001, 1,860 sample members refused at least once to participate 
in the interview. Of those, 74 percent were converted and interviewed. 

Field Interviewing 

Field interviewers were assigned a total of 1,213 cases, covering 30 geographic 
clusters. Cases were identified for the field for a number of reasons, including inability 
to locate in CATI, Puerto Rico residence, refusal in CATI, and exhaustion of locating 
leads. Only cases located in reasonable geographic proximity to a field interviewer were 
assigned to the field. Of the 1,213 cases fielded, 80 percent were contacted, and 90 
percent of those were interviewed, for an unweighted response rate of 72 percent. 

Nonresponse Incentive 

Incentives were offered as necessary to targeted sample members in order to 
encourage participation among sample members who would otherwise not have 
participated in the interview. Those offered incentives included the BPS:96/98 
nonrespondents, a subset of refusal cases, and those who were hard to reach or could not 
be located. By the end of data collection, 4,106 sample members had been offered 
incentives and, of those, 72 percent were converted. 

Indeterminate Responses 

Efforts were made to encourage response to all items in the BPS: 1996/2001 
interview and to convert indeterminate responses (i.e., “don’t know” and “refusal” 
responses), especially for those items that historically have had high nonresponse (e.g., 
income). As a result, item nonresponse was quite low throughout the interview. Only 9 
of the 445 CATI items had indeterminate response rates in excess of 10 percent. 

Interview Timing 

The average administration time for the BPS: 1996/2001 interview was 17.8 
minutes, over 2 minutes shorter than the first follow-up interview (BPS:96/98). In the 
2001 interview, BPS:96/98 nonrespondents took an average of 3.6 minutes longer than 
BPS:96/98 respondents. This is because the 2001 interview updated enrollment and 
employment information since the last interview (in 1996 for BPS:96/98 nonrespondents 
and in 1998 for BPS:96/98 respondents). 
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Online Coding 



The BPS: 1996/2001 instrument included systems allowing the interviewer to 
perform computer-assisted online coding of literal responses for postsecondary 
institution, major, occupation, and industry. These online coding systems were designed 
to improve data quality by capitalizing on the availability of the respondent to clarify 
responses at the time the coding was performed. Only the postsecondary institution 
coding system - which included only U.S. institutions - resulted in more than 10 percent 
uncodeable responses, primarily because some sample members attended foreign 
institutions. 

Analysis Weights 

Cross-sectional weights were developed for analyzing the respondents to the 
BPS: 1996/2001 interview. In addition, two longitudinal weights were constructed, one 
for analyzing the students who participated in all three interviews — NPSAS:96, 
BPS:96/98, and BPS: 1996/2001 — and the other for analyzing the students who 
participated only in NPSAS:96 and BPS: 1996/2001. Variances were computed using the 
Taylor Series and balanced repeated replications (BRR) techniques. Weighted response 
rates and survey design effect tables are provided in chapter 6. 

Data Files 

Because BPS: 1996/2001 was the third of three interviews, the BPS: 1996/2001 
data set includes the derived variable and interview files for all three interviews. Also 
included are data collected from institution records, government databases, and admission 
test vendors throughout the period covered by the NPSAS:96 interview through the 
BPS: 1996/2001 interview. 

Products 

In addition to the methodology report, NCES plans to release the following major 
products for BPS: 1996/2001: a public-use Data Analysis System (DAS), restricted-use 
research files with an associated electronic codebook (ECB), and a descriptive summary 
of significant findings with an essay on the persistence and attainment of students at 4- 
year institutions. The DAS, containing derived variables and associated documentation, 
will enable users to specify and create numerous tables. Restricted-use files will be 
available to those researchers who need raw data not included in the DAS and who have 
applied for and received authorization from NCES. The descriptive summary, as the first 
NCES report based on this data set, will discuss major findings on persistence and 
attainment and present additional descriptive statistics in a table compendium. 
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Foreword 



This report describes the methods and procedures used for the full-scale data 
collection effort of the Beginning Postsecondary Students Second Follow-up Study 
1996-2001 (BPS: 1996/2001). These students, who started their postsecondary education 
during the 1995-96 academic year, were first interviewed during 1996 as part of the 
National Postsecondary Student Aid Study: 1996 (NPSAS:96). They were subsequently 
interviewed in the spring of 1998. The BPS: 1996/2001 study is the second follow-up of 
this cohort. 

BPS: 1996/2001, included important changes from the BPS:90/94 follow-up 
surveys, conducted in 1992 and 1994 following the cohort of beginning postsecondary 
students selected as part of NPSAS:90. The data collection instrument was considerably 
refined to reduce respondent burden while still collecting key information on 
postsecondary enrollment, employment, and demographics. In addition, BPS: 1996/2001 
was conducted during the sixth academic year (compared to the fifth academic year for 
BPS:90/94), thus collecting attainment information for students who completed their 
degree in either their fifth or sixth year. 

Evaluation of the procedures used in the full-scale data collection were developed 
and refined as part of the field test conducted in 2000. We hope that the information 
provided here will be useful to a wide range of interested readers and that the results 
reported in the forthcoming full-scale descriptive summary report will encourage others 
to use the BPS data. We welcome recommendations for improving the format, content, 
and approach, so that future methodology reports will be more informative and useful. 



C. Dennis Carroll 
Associate Commissioner 
Postsecondary Studies Division 
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Chapter 1 
Overview of BPS:1996/2001 



This document describes the procedures and results of the full-scale implementation of 
the Beginning Postsecondary Students Longitudinal Study: 1996-2001 (BPS: 1996/2001). 

BPS: 1996/2001 is the second follow-up data collection conducted with students who started their 
postsecondary education during the 1995—96 academic year. Research Triangle Institute (RTI), 
with the assistance of MPR Associates, Inc. (MPR), conducted the study for the National Center 
for Education Statistics (NCES) of the U.S. Department of Education (Contract No. ED-99-CO- 
0112), as authorized under Section 404(a) of the National Statistics Act of 1994 [PL103-382], 

This introductory chapter describes the background, purposes, schedule, and products of 
BPS: 1996/2001. The second chapter describes the design and methods used during the full-scale 
study. A discussion of data collection outcomes is presented in chapter 3. A description of the 
procedures implemented to ensure the quality of the BPS data and an evaluation of the quality of 
the data collected are provided in chapter 4. Data file construction is discussed in chapter 5. 
Results of sample weighting and variance estimation are presented in chapter 6. Additional 
materials used during the study are provided as appendices to the report and cited, where 
appropriate, in the text. 

A. Background and Objectives of BPS 

Each academic year, several million students begin postsecondary education for the first 
time. The Beginning Postsecondary Students Longitudinal Study (BPS) series provides an 
opportunity to describe these students during their first year and at multiple time points after 
their first year. As one of several studies sponsored by NCES to respond to the need for a 
national, comprehensive database on postsecondary education, the BPS series addresses issues 
related to enrollment, persistence, progress, attainment, continuation into graduate/professional 
school, employment, and rates of return to society. 

Since nearly half of all beginning students enroll at more than one institution during the 5 
years after they begin postsecondary education, 1 being able to monitor the progress of these 
students across postsecondary institutions has become increasingly important. Through its 
unique design, the BPS study series makes it possible to trace the paths of first-time beginning 
students (FTBs) throughout the entire system of postsecondary education over a number of years. 
Consequently, whereas typical retention and attainment studies of entering freshmen provide 



1 Berkner, L.K., Cuccaro-Alamin, S., and McCormick, A.C. (1996). Descriptive Summary of 1989-90 
Beginning Postsecondary Students: 5 Years Later, with an Essay on Postsecondary Persistence and Attainment 
(NCES-96-1 55, ED396597). U.S. Department of Education. Washington, DC: National Center for Education 
Statistics. 
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data at a single institution, BPS allows for the study of student persistence and attainment 
anywhere. The BPS series is also unlike previous longitudinal studies of high school age cohorts 
in that its student sample includes nontraditional postsecondary students who delayed 
continuation of their education after high school for a variety of reasons. 

The first BPS series, BPS:90, involved data collection at three points in time (see 
figure 1.1). Base year data collection during the first year of postsecondary study occurred 
during the 1989-1990 academic year for the 1990 cohort, as part of the 1990 National 
Postsecondary Student Aid Study (NPSAS:90). Two subsequent data collections took place in 
the third (BPS:90/92) and fifth academic years (BPS:90/94) following initial enrollment. 

The current series, BPS:96, collected data at three similar points in time (figure 1.1). 
Consistent with BPS: 90, base year data collection occurred as part of NPSAS:96, the first year of 
postsecondary study for the 1996 cohort, and the first follow-up (BPS:96/98) occurred 2 years 
later, during the third academic year following entry. However, unlike BPS:90, the second 
follow-up of the 1996 cohort (BPS: 1996/2001) was conducted 6 academic years following entry, 
rather than 5. This timing allowed for the collection of attainment information for students who 
completed their degree in either their fifth or sixth year. 

Only students who had never completed a postsecondary course prior to the 1995-96 
academic year were eligible for participation in BPS: 96. Questions for FTB-determination, 
along with items addressing how students and their families pay for postsecondary education, 
were administered as part of the base year studies (NPSAS:90; NPSAS:96). Items in the first 
follow-up studies (BPS:90/92; BPS:96/98) focused on issues of persistence — academic progress 
through the first 3 years of postsecondary study — among students enrolled in 4-year institutions, 
and attainment among students enrolled in less-than-2-year and 2-year colleges. Nontraditional 
students were asked about the reasons they delayed enrollment, their prior employment 
experience, and their purpose for enrolling. Interviews addressed the differences between those 
with immediate vocational goals and those intending to earn a bachelor’s degree, including those 
beginning at community colleges. In addition, sets of items identified transfers, stopouts, and 
dropouts, and the reasons for these enrollment behaviors. 

Because the second follow-up of the BPS:90 cohort, BPS:90/94, occurred during the fifth 
academic year and the second follow-up of the BPS:96 cohort, BPS: 1996/2001, took place 
during the sixth academic year since first enrollment, some items in the BPS: 1996/2001 
interview collected retrospective information about the fifth academic year to allow cross-cohort 
comparisons. Persistence and attainment among students enrolled in 4-year institutions and 
employment among students no longer enrolled were the primary topics for the second follow- 
up. These studies serve to monitor academic progress over time, allowing assessment of 
completion rates for 4-year programs in the normal time expected. For students who graduated 
in the 4-year time period, the BPS: 1996/2001 survey occurred 2 years after receipt of the 
bachelor’s degree and addressed issues of attainment, graduate school access, and initial rate of 
return. 
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Figure 1.1. — Chronology of the Beginning Postsecondary Students 
Longitudinal Study 



BPS:1990 

Cohort 




BPS:1996 

Cohort 




NOTE: AY1 through AY6 indicate the first through sixth academic years covered by the longitudinal period for 
each study. BPS sample members first entered postsecondary education during AY1. 
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For those students who terminated their postsecondary education prior to completion of a 
baccalaureate degree, the BPS: 1996/2001 follow-up 6 years after college entry collected more 
detailed information on continuation and rate of return. It provided information on how many 
FTBs returned for additional education either in the same or a different field within the limited 
time period. For those who did not continue, it provided some rate of return information for 
employment and other societal benefits related to education. 

By following a cohort of new entrants into postsecondary education (PSE), the BPS 
series of studies provides a unique perspective of what happens to persons as they enter and 
pursue education beyond high school. Because it includes both nontraditional and traditional 
students who entered PSE immediately after high school, BPS permits study of educational 
aspirations, progress, persistence, and attainment for both groups of students. By providing 
longitudinal data for a single cohort and trend data across cohorts, the BPS series contributes to 
our understanding of the value of a student’s postsecondary education both to the student and to 
society, and to the comprehensive national database addressing policy issues at the 
postsecondary level. 

B. Schedule and Products of BPS:1996/2001 

The operational schedule for BPS:1996/2001 is presented in table 1.1. 



Table 1.1. — Operational schedule for BPS:1996/2001 



Activity 


Start date 


End date 


Sampling 


12/2000 


4/2001 


RIMG/OMB forms clearance 


6/2000 


2/2001 


Instrument development 


8/2000 


2/2001 


Tracing 


7/2000 


12/2001 


CATI training 


2/2001 


4/2001 


CATI data collection 


2/2001 


9/2001 


CAPI training 


5/2001 


5/2001 


CAPI data collection 


5/2001 


9/2001 


Data files and documentation 


3/2001 


12/2001 


Methodological reporting 


3/2001 


6/2002 


Descriptive reporting 


3/2001 


2/2003 



The major products of BPS: 1996/2001 include 

• restricted-use research files with an associated Electronic Codebook (ECB) 

containing all data collected for BPS: 1996/2001, all BPS.96 data from the base year 
and first follow-up data collections, derived analytic variables, and sampling weights; 
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• a public-use Data Analysis System (DAS) containing analytic variables, associated 
documentation, and tools to produce a variety of user-specified tables; 

• methodology reports for the field test and full-scale studies, providing details of 
sample design, data collection procedures, data file construction, sample weighting, 
variance estimation, and the results of nonresponse bias analyses; 

• special tabulations of issues of interest to the higher education community; and 

• a descriptive summary of significant findings with an essay on persistence and 
attainment of students at 4-year institutions. 



Chapter 2 
Design and Method 



A. Sampling Design 

1. Respondent Universe 

The respondent universe for the BPS:1996/2001 full-scale study consisted of all students 
who began their postsecondary education for the first time during the 1995-96 academic year at 
any postsecondary institution in the United States or Puerto Rico. The sample students were the 
first-time beginning students (FTBs) who attended postsecondary institutions eligible for 
inclusion in NPSAS:96 and who were themselves NPSAS-eligible. 

a. Institution Universe 

Consistent with previous NPSAS studies, institutions eligible for NPSAS:96 and, 
consequently, eligible for the BPS:96 cohort, were those that satisfied all of the following 
conditions for the 1 995-96 academic year: 

• offered an educational program designed for persons who have completed 
secondary education; 

• offered more than just correspondence courses; 

• offered at least one academically, occupationally, or vocationally oriented 
program of study requiring at least 3 months or 300 contact hours of instruction; 

• offered courses that were open to the general public (i.e., not just to specific 
populations such as prison inmates or members of the organization offering the 
courses); and 

• were located in the United States or Puerto Rico. 

U.S. service academies were excluded from participation because of their atypical 
funding and tuition base. Also ineligible were institutions offering only avocational, recreational 
remedial, or correspondence courses; institutions not open to the public; hospitals offering only 
internships or residency programs; institutions offering only noncredit continuing education 
units (CEUs); schools whose only purpose was to prepare students to take a particular 
examination (e.g., CPA or Bar exams); institutions offering only programs of study which 
required less than 3 months or 300 contact hours of instruction; and branch campuses of U.S. 
institutions in foreign countries. 
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b. Student Universe 

Students eligible for the BPS:96 cohort were those students eligible for NPSAS:96 who 
were FTBs at NPSAS sample institutions during the 1995-96 academic year (except those who 
were deceased). NPSAS :96-eligible students were enrolled in NPSAS-eligible institutions 
during the 1995-96 academic year and satisfied all of the following eligibility requirements: 

• were enrolled in a term or course that began between May 1, 1995, and April 30, 

1 996; 1 

• were enrolled in either (a) an academic program; (b) at least one course for credit 
that could be applied toward fulfilling the requirements for an academic degree; 
or (c) an occupational or vocational program that required at least 3 months or 
300 clock hours of instruction to receive a degree, certificate, or other formal 
award; 

• were not concurrently enrolled in high school; and 

• were not enrolled solely in a GED or other high school completion program. 

The NPSAS-eligible students who had never enrolled in a postsecondary institution after 
completing high school were considered “pure” FTBs and were, of course, eligible for the 
BPS:96 cohort. However, those NPSAS-eligible students who had enrolled for at least one 
course after completing high school but had never completed a postsecondary course before the 
1995-96 academic year were considered “effective” FTBs and were also eligible for the BPS:96 
cohort. 



2. Statistical Methodology 

The NPSAS:96 sampling design was a two-stage design in which eligible institutions 
were selected at the first stage and eligible students were selected at the second stage within 
eligible, responding sample institutions. The NPSAS:96 sample, the process of identifying and 
selecting FTBs for the BPS follow-up studies, and the BPS: 1 996/2001 subsampling procedures 
are described below. 



a. NPSAS:96 Institution Sample 

The institution-level sampling frame for NPSAS:96 was constructed from the 1993-94 
Integrated Postsecondary Education Data System (IPEDS) Institutional Characteristics (IC) file 2 . 



1 This full year of enrollment is the operational survey population. The ideal target population consists of 
the terms in the 1995-96 financial aid award year, those beginning between July 1, 1995, and June 30, 1996. The 
survey year is slightly shifted from the ideal year to allow more timely data collection and dissemination of results. 

2 The 1993-94 IPEDS IC file was the latest version available at the time of NPSAS:96 institutional 
sampling. 



8 



Chapter 2: Design and Method 



The following sets of records that did not correspond to institutions eligible for 
NPSAS:96 were deleted: 

• administrative units (SECTOR=0); 

• U.S. Service academies (OBEREG = 00); 

• U.S. Territories, except Puerto Rico (OBEREG = 09 and STABBR not ‘PR’); 

• institutions that offer no programs of at least 300 contact hours, 6 semester or 
trimester hours, or 12 quarter hours and for which the highest level of offering 
was a certificate or diploma of less than 1 academic year (PG300 = 2 and 
HLOFFER < 1); 

• institutions offering only correspondence courses (UNITID=249928, 137379, 
367644, and 385363); 3 and 

• 12 institutions with reported real (not imputed) zero enrollment (based on 
unduplicated head counts) for the 1992-93 academic year. 4 

These edits resulted in a sampling frame consisting of 9,468 institutions that appeared to be 
eligible for NPSAS:96 based on their 1993-94 IPEDS IC data. 

Sample institutions were selected for NPSAS:96 with probabilities proportional to 
composite measures of size based on overall sampling rates by type of institution and type of 
student. The overall institution sample sizes and sampling rates are shown in table 2.1 for each 
of the nine institutional sampling strata. The expected frequency of selection exceeded unity 
(1.00) for some institutions because of their relatively large enrollment within their stratum. 
These institutions were included in the sample with certainty. The numbers of certainty and 
noncertainty institutions selected are shown for each stratum in table 2.2. 

Within each of the nine institutional strata, additional implicit stratification was 
accomplished by sorting the sampling frame for each stratum in a serpentine manner 5 by the 
following variables: 

• institutional level; 

• the Office of Business Economics (OBE) Region (from the IPEDS IC file) with 
Alaska and Hawaii moved to Region 9 with Puerto Rico; and 

• the institution measure of size. 



3 These were identified by calling the institutions. The calls resulted from searching for “corr” in the name 
of the institution and from checking discrepant/outlier enrollment data. 

4 Unduplicated head count data are collected for the academic year prior to the one in which the IPEDS data 
collection is conducted. 

5 Williams, R.L., and Chromy, J.R. (1980). “SAS Sample Selection MACROS.” Proceedings of the Fifth 
Annual SAS Users Group International Conference, 392-396. 
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Table 2.1. — Institution sampling rates by institutional stratum 



Institutional stratum 


Size of 
universe 1 


Sample 

size 2 


Sampling 

rate 


Total 


9,468 


973 


0.10 


Public less-than-2-year 


273 


39 


0.14 


Public 2-year 


1,265 


165 


0.13 


Public 4-year non-doctorate-granting 


376 


125 


0.33 


Public 4-year doctorate-granting 


243 


124 


0.51 


Private not-for-profit less-than-4-year 


902 


56 


0.06 


Private not-for-profit 4-year non-doctorate-granting 


1,306 


120 


0.09 


Private not-for-profit 4-year doctorate-granting 


681 


143 


0.21 


Private for-profit less-than-2-year 


3,516 


120 


0.03 


Private for-profit 2-year or more 


906 


81 


0.09 



'Based on the 1993-94 IPEDS IC file. 

inflated to account for ineligible and nonresponding sample institutions. 



SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1 996/200 1 ). 



Table 2.2. — Number of certainty and noncertainty institutions by institutional stratum 



Institutional stratum 


Sample institutions 


Number participating 


Total 


Certainty 


Non- 

certainty 


In 

NPSAS ^ 1 


With 

FTBs 2 


Total 


973 


131 


842 


836 


788 


Public less-than-2-year 


39 


10 


29 


31 


30 


Public 2-year 


165 


7 


158 


159 


156 


Public 4-year non-doctorate-granting 


125 


14 


111 


119 


114 


Public 4-year doctorate-granting 


124 


29 


95 


125 


120 


Private not-for-profit less-than-4-year 


56 


6 


50 


42 


36 


Private not-for-profit 4-year non-doctorate-granting 


120 


2 


118 


102 


98 


Private not-for-profit 4-year doctorate-granting 


143 


54 


89 


130 


114 


Private for-profit less-than-2-year 


120 


2 


118 


61 


59 


Private for-profit 2-year or more 


81 


7 


74 


67 


61 



'institution classifications used here were verified by the institutions to correct classification errors on the sampling frame. 
2 Some NPSAS:96 institutions had no FTB students. 



SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 

The objectives of this additional, implicit stratification were to ensure proportionate 
representation of institutions by level for the two strata that include institutions at two levels; to 
ensure proportionate representation of all geographic regions; and to ensure proportionate 
representation of small institutions. 

The effect of the implicit geographic stratification is seen in table 2.3, which shows that 
the geographic distribution of the sample is comparable to that of the survey population (the 
eligible institutions in the 1993-94 file). 
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Table 2.3.— Distribution of NPSAS:96 sample by region 



Region 


Sample institutions 


IPEDS universe 


Number 


Percent 


Number 


Percent 


Total 


973 


100.0 


9,468 


100.0 


1 . New England 


62 


6.4 


542 


5.7 


2. Mid East 


181 


18.6 


1,557 


16.4 


3. Great Lakes 


150 


15.4 


1,486 


15.7 


4. Plains 


70 


7.2 


801 


8.5 


5. Southeast 


194 


19.9 


2,105 


22.2 


6. Southwest 


89 


9.1 


878 


9.3 


7. Rocky Mountains 


34 


3.5 


322 


3.4 


8. Far West 


170 


17.5 


1,622 


17.1 


9. Outlying Areas 


23 


2.4 


155 


1.6 



Legend: 

1 = CT, ME, MA, NH, RI, VT 

2 = DE, DC, MD, NJ, NY, PA 

3 = IL, IN, MI, OH, WI 

4 = IA, KS, MN, MO, NE, ND, SD 

5 = AL, AR, FL, GA, KY, LA, MS, NC, SC, TN, VA, WV 



6 = AZ, NM, OK, TX 

7 = CO, ID, MT, UT, WY 

8 = AK, CA, HI, NV, OR, WA 

9 = PR 



SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 



b. NPSAS:96 Student Sample 



Each sample institution was asked to provide a database or hard-copy list of all its 
NPSAS-eligible students enrolled during the NPSAS year. Students were sampled on a flow 
basis as the student files and lists were received. Machine-readable lists were unduplicated by 
student ID number prior to sample selection. Stratified systematic sampling was used to 
facilitate sampling from both hard-copy and machine-readable lists. For each institution, the 
student sampling rates, rather than the student sample sizes, were held constant (fixed) for the 
following reasons: 

• to facilitate sampling students on a flow basis as student lists were 
received; 

• to facilitate the procedures used to “unduplicate” the sample selected from 
duplicated hard-copy lists; and 

• because sampling at a fixed rate based on the overall stratum sampling 
rate and the institutional probabilities of selection results in approximately 
equal overall probabilities of selection within the ultimate student strata. 

For each sample institution, the student sampling rates were determined for each of four 
student sampling strata: 

• potential FTBs, 

• other undergraduate students, 

• first professional students, and 

• other graduate students. 
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The institutions were asked to specify the student level (undergraduate, first professional, 
or other graduate student) based on the student’s last term of enrollment during the NPSAS year. 
Furthermore, they were asked to identify their undergraduate students whose first term of 
enrollment at the institution was during the NPSAS year, who were freshman or first-year 
students at that time, and who did not have any transfer credits from another postsecondary 
institution. Those students were classified as the potential FTBs. The sampling rates depended 
on the overall population sampling rates for the four types of students, the probability of 
selecting the institution, and a requirement for a minimum of 40 sample students per institution 
whenever possible. 

NPSAS:96 data collection consisted of computer-assisted data entry (CADE) from 
records maintained by the institutions (e.g., at the financial aid or registrar’s office) for all 
sample students as well as computer-assisted telephone interviews (CATI) with sample students. 
Unfortunately, a sample student’s FTB status could not be determined until the student’s CATI 
interview had been completed. Therefore, potential FTBs were oversampled in NPSAS in an 
attempt to yield a sufficient number of BPS-eligible sample members. A total of 12,410 cases 
were identified as either pure or effective FTBs and thus were eligible for the BPS:96 cohort. 

c. BPS:1 996/2001 Sample 

Of the 12,400 eligible for the BPS:96 cohort, 10,300 completed the BPS:96/98 interview 
and were verified to be FTBs. The BPS: 1996/2001 sample consisted of these BPS:96/98 
respondents plus almost 1,800 NPSAS:96 respondents (BPS:96/98 nonrespondents) who were 
verified to be FTBs. Excluding those cases identified as deceased since their last interview, 
almost 12,100 sample members eligible for BPS: 1996/2001. 

To contain costs for the full-scale study, the eligible BPS:96/98 nonrespondents were 
subsampled. A sample of BPS:96/98 nonrespondents with probabilities proportional to their 
initial weights was selected. Of these cases, a stratified random subsample was selected to 
include at the beginning of data collection. The remaining cases were reserved for possible 
fielding at a later date if necessary and not cost-prohibitive, but ultimately were not included in 
the BPS: 1 996/200 1 sample. The details of this sampling are described below. 

The first step entailed defining three nonrespondent subsampling strata based on whether 
the parent postcard was returned and whether the sample member either matched to the Central 
Processing System (CPS) database or Telematch produced a good telephone number. It was 
expected that sample members whose parents returned the postcard were most likely to be 
located and interviewed. Those whose parents did not return the postcard but who matched to 
CPS or Telematch were assumed to be somewhat less likely to be located and interviewed. 
Sample members whose parents did not return the postcard and who did not match to CPS or 
Telematch were assumed to be least likely to be found and interviewed. These three sampling 
strata were then subdivided based on institutional strata because FTBs were sampled at different 
rates at different types of institutions. Preliminary analyses showed that without this subdivision 
of the sampling strata, the unequal weighting design effects for institutional analysis strata 
become unacceptably large. A sample allocation was chosen that maximized the unweighted 
response rates and those rates were then scaled to achieve the desired sample sizes. The sampling 
strata and sampling rates, are shown in table 2.4. 
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Table 2.4. — Sampling rates for BPS:96/98 nonrespondents 



Stratum 


Mail/merge result 


Institutional stratum 


Sampling rate 


Total 








1 




1 or 2 


0.423 


2 




3 


0.265 


3 


Parent postcard 


4 


0.198 


4 


returned 


5 or 6 


0.312 


5 




7 


0.365 


6 




8 or 9 


0.388 


7 




1 or 2 


0.140 


8 




3 


0.066 


9 




4 


0.053 


10 


Matched CPS or 


5 


0.193 


11 


Telematch 


6 


0.083 


12 




7 


0.057 


13 




8 


0.155 


14 




9 


0.179 


15 




1 or 2 


0.170 


16 




3 


0.058 


17 




4 


0.068 


18 


Postcard not returned 


5 


0.145 


19 


and no match to CPS 


6 


0.069 


20 


or Telematch 


7 


0.059 


21 




8 


0.129 


22 




9 


0.160 



Legend for institutional stratum: 

1= public less-than-2-year 6= private not-for-profit 4-year non-doctorate-granting 

2= public 2-year 7= private not-for-profit less-than-2-year 

3= public 4-year non-doctorate-granting 8= private for-profit less-than-2-year 

4= public 4-year doctorate-granting 9= private for-profit 2-year or more 

5= private not-for-profit less-than-4-year 

NOTE: The CPS (Central Processing System) contains locating information for all sample members who applied for federal 
financial aid for a given year. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 
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Finally, a stratified sample of nonrespondents with probabilities proportional to their 
initial weights was selected, using the sample allocation computed in the previous step. A 
stratified random subsample of these cases was selected to include as part of the initial 
BPS: 1996/2001 sample. Due to the high cost of locating these sample members and their 
relatively low interview rates, the remaining nonrespondents were not added to the sample. 

3. BPS:1 996/2001 Field Cluster Selection 

Field interviewing, discussed in detail later in this chapter, required the selection of 
geographic clusters. These geographic clusters were selected at the start of data collection to 
maximize the likelihood of having a high number of sample members in each area. The 
geographic clusters were defined by the following multistep process: 

• First, a unique zip code was associated with each sample member, based on their “best 
address” available. The U.S. Postal Service’s address standardizing service was used to 
clean addresses and obtain zip codes for as many addresses as possible. 

• Next, RTFs geographic information system (GIS) was loaded with each sample 
member’s zip code. 

• Finally, the GIS plotted each zip code, identifying concentrations of sample members 
within 50-mile radii. 

This process resulted in 30 geographic clusters, each containing between 63 and 900 potential 
field cases. CATI nonrespondents were assigned to one of the 30 geographic clusters based on 
the latest tracing information available at the time that a sample member was identified for field 
interviewing. If the most recent locating information fell outside the 30 clusters, the case was 
treated as a “hard to reach” case (described below). 

B. Data Collection Design 

1. Instrument Development 

The BPS: 1996/2001 interviews were conducted using computer-assisted interviewing 
(CAI) technology to conduct both telephone and in-person interviews. In preparation for the 
development of the CATI/CAPI instrument, a comprehensive set of data elements was developed 
from a review of the data elements used for the BPS:90 cohort, their relationship to the 
NPSAS:96 and BPS:96/98 data elements, the reliability of responses obtained in BPS:90, and 
their relevance to current research and policy issues. To allow for cross-cohort comparisons 
with BPS:90/94, the data elements included retrospective information. 6 A preliminary set of 
BPS: 1996/2001 data elements was refined with input from the study’s Technical Review Panel 
(TRP; see appendix A for a list of members) as well as from NCES and other Department of 




6 Because BPS: 90/94 occurred in the fifth academic year and BPS: 1996/2001 occurred in the sixth 
academic year, retrospective information was collected to allow comparisons between the two cohorts. See 
figure 1.1 and the accompanying discussion in chapter 1 regarding the timing of the follow-ups of the two cohorts. 
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Education staff. The final set of data elements is presented in appendix B. 

Based on the data elements, the BPS: 1996/2001 CATI/CAPI instrument was first 
developed for the field test data collection effort and then, with feedback from NCES and 
recommendations from the TRP, revised for the full-scale data collection. The instrument was 
structured by identifying section topics and determining the progression of items within sections. 
Individual items were designed with several goals in mind: (1) using existing items (that have 
been previously tested) when feasible; (2) ensuring consistency with NPSAS:96, BPS:96/98, and 
BPS:90/94 items when items were not identical; and (3) identifying and preparing wording for 
item verifications and probes as necessary. Detailed instrument specifications were written for 
each item, including variable names and definitions, skip patterns, and out-of-range limits. 

Instrument sections were reviewed on a flow basis by NCES. As depicted in figure 2.1, 
the first section collected information about all postsecondary enrollment since the previous 
interview. 7 The next two sections collected information about undergraduate and post- 
baccalaureate (graduate or additional undergraduate) school experiences, respectively. 
Employment, particularly addressing rate of return policy issues, was the focus of the fourth 
section. This section asked about the first job after leaving school for those who were not asked 
about first job in the BPS:96/98 interview (because they were still enrolled), as well as current 
job information. The final section collected background and current status information such as 
family formation/household composition, income, debts, civic participation, disabilities, and 
goals. A facsimile interview is provided in appendix C. 

Despite different data collection methods, the CATI and CAPI interviews were 
programmed identically, using version 4.3 of the Computer-Assisted Survey Execution System 
(CASES) software. The CATI/CAPI system presented interviewers with screens of questions to 
be asked of the respondents, with the software guiding the interviewer and respondent through 
the interview. Inapplicable questions were automatically skipped based on prior response 
patterns and preloaded information. Wording for probes was suggested when a respondent 
provided a response that was out of range for a given item. Help text was provided for each 
screen in the event that clarification of question intent was required. Online coding programs for 
IPEDS, enrollment terms, major, financial aid, and occupation/industry were incorporated to 
allow standard coding of responses. 

Concurrent with the design and programming of the CATI/CAPI instrument, instrument 
documentation was entered into an integrated data dictionary system (DDS), which subsequently 
facilitated production of data files with CATI/CAPI variable documentation. An abbreviated 
instrument was developed for the purpose of interviewing special respondent groups such as 
sample members whose primary language is Spanish. The abbreviated instrument, also 
presented in appendix C, focused on the respondent’s postsecondary enrollment history, 
undergraduate experiences, employment, and family formation. 



7 The instrument consisted of sections B through F. The need for section A, Eligibility Determination, was 
eliminated as all sample members were either NPSAS:96 or BPS:96/98 respondents. 
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Figure 2.1. — Structure and flow of the BPS: 1996/2001 interview 





16 



Chapter 2: Design and Method 



To minimize the interview burden on respondents, the CATI/CAPI instrument used 
extant data whenever feasible. Preloaded values from the locator database and data from the 
NPSAS:96 and BPS:96/98 interviews were used to confirm the identity of sample members and 
to reduce data collection time, effort, and cost. The preloaded data dictated the flow of many 
portions of the interview. Certain questions were asked only if the data were missing from prior 
interviews. Other questions used the NPSAS.96 and BPS:96/98 preloads to provide context 
(e.g., “I'd like to begin by asking you some questions about your school enrollment since the last 
time we talked to you in 1998. According to our records, you were enrolled at North Carolina 
State University at that time. Are you still enrolled there?”). In other questions, respondents 
were asked to update information since the last interview based on preloaded information (e.g., 
“Last time we talked to you, your major or program of study while attending North Carolina 
State University was electrical engineering. Is that still your major?”). 

Once CATI/CAPI programming was completed, test cases were developed and loaded for 
instrument testing and interviewer training. Project staff systematically tested the CATI/CAPI 
instrument prior to the start of interviewer training. Finally, preload files containing data from 
NPSAS:96, BPS.96/98, and the Department of Education databases were prepared and loaded 
into the CATI/CAPI system to both guide the interview and assist sample member locating 
efforts. Data collection commenced only after all of these tasks were complete. 

2. Locating 

The BPS: 1996/2001 sample members were at a stage in their lives where they tended to 
be highly mobile, having moved at least once, if not multiple times, since they were last 
interviewed. Consequently, it was a difficult population to locate. The BPS: 1996/2001 design 
involved tracing sample members to their current location and conducting an interview by 
telephone (CATI) or in person (CAPI) with them about their experiences since their last 
interview (the BPS:96/98 interview 3 years earlier or the NPSAS:96 interview 5 years earlier). 
The locating activities, depicted in figure 2.2 and discussed in the following sections, involved 
advance locating conducted before the start of CATI, locating activities performed by telephone 
interviewers as part of CATI operations, intensive tracing by RTI’s Tracing Operations Unit 
(TOPS), and field locating. 

a. Advance Locating 

Locating information was collected during the NPSAS.96 and BPS:96/98 interviews and 
incorporated into the locator database. The locating information included the sample members’ 
local and permanent addresses and telephone numbers, the addresses and telephone numbers of 
parents and friends of sample members, drivers license information, and Social Security 
Numbers. These locating data were updated by the U.S. Postal Service National Change of 
Address (NCOA) and by Telematch operations, which provided updated address and telephone 
number information, respectively. Department of Motor Vehicle (DMV) searches were 
conducted in the six states containing the largest concentrations of sample members (California, 
Texas, Florida, New York, Illinois, and Michigan) to obtain additional locating information. 
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Figure 2.2. — BPS: 1996/2001 full-scale data collection: tracing activities 
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Figure 2.2. — BPS:1996/2001 full-scale data collection: tracing activities — 
Continued 
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Four months prior to the start of data collection, a mailing, consisting of a letter, a study 
leaflet, and an address update information sheet, was sent to the parents or other contacts of 
sample members to update the most recent sample member addresses and to gain cooperation by 
explaining the purposes of the study. A similar mailing, consisting of a letter, a study leaflet, a 
call-in card, and an address update information sheet (examples of each are in appendix D), was 
sent to sample members immediately prior to the start of data collection. The purpose of this 
mailing was to notify the sample members of the upcoming interview, inform them of their rights 
as participants, stress the importance of the study and urge their participation, and obtain 
additional postal service address updates. The mailing also gave sample members the 
opportunity to complete and return an address update form. New contact information obtained 
from the mailing was entered into the locator database. 

To expand efforts to gain parent cooperation, a postcard was mailed to the parents of 
sample members at the beginning of the data collection period, informing them of the upcoming 
data collection. The postcard consisted of a note explaining the study as well as a perforated card 
for the parent to tear off and give to the sample member (see appendix D). The card asked the 
sample member to call in using the toll-free telephone number shown and complete the interview 
at his or her convenience. This addressed a problem encountered in the field test and other 
NCES studies, namely, that parents sometimes acted as “gatekeepers” making it difficult to 
locate and speak with the sample member. 

Additional pre-CATI tracing was performed for sample members identified as BPS.96/98 
nonrespondents, those with insufficient telephone number information, and those for whom we 
received undeliverable mail returns through RTFs TOPS Unit. TOPS’s intensive tracing 
operations are described below. 

b. CATI Locating 

In addition to the advance locating activities described above, tracing efforts were 
undertaken by interviewers in the Telephone Survey Unit (TSU), concurrent with their efforts to 
gain cooperation from and interview sample members. When assigned a case, the telephone 
interviewer called the telephone number designated by the system as the best number (i.e. , the 
number among all available locator numbers that appeared to have the greatest potential for 
contacting the sample member) and attempted to interview the designated sample member. If 
the person at that number indicated that the sample member could not be reached there, the 
interviewer requested additional contact information for the sample member. If the person was 
unable to provide additional information, the interviewer called additional telephone numbers 
associated with the case in an attempt to locate the sample member. After all possible telephone 
numbers for the case were exhausted without success, the case was assigned to TOPS for 
intensive tracing. 
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c. Intensive Tracing 

Intensive tracing was performed by RTI’s TOPS unit, which had access to both 
proprietary and public domain data. TOPS tracers had real-time access to consumer databases 
that contained current address and phone listings for the majority of consumers with credit 
histories. In addition to proprietary databases, TOPS had access to various other information 
sources, such as Dataminers, commercial list-houses, and NCOA via leased line. These sources 
searched for name, address, neighbor, business, telephone number, and status (decedent, 
incapacitated, military). 

A two-tiered intensive tracing plan was used to locate sample members. The first tier 
involved identifying sample members with Social Security Numbers and processing that 
information through two credit bureau searches. If the searches generated a new telephone 
number, that case was returned to TSU for telephone interviewing. If a new address was 
generated but no telephone number was provided, tracers called directory assistance or queried 
other databases to obtain telephone numbers for CATI. This first level of effort minimized the 
time that cases were out of production. 

The more intensive second tier was implemented for those cases where the first level 
searches were unsuccessful. This involved the following tracing procedures: (1) checking 
directory assistance for telephone listings at various addresses; (2) using reverse-match databases 
to obtain the names and telephone numbers of neighbors and then calling the neighbors; 

(3) calling persons with the same unusual surname in small towns or rural areas to see if they 
were related to or knew the sample member; (4) contacting the current or last known residential 
sources such as the neighbors, landlords, current residents, tax assessors, realtors, and other 
business establishments related to previous addresses associated with the sample member; (5) 
calling colleges and military establishments to follow up on leads generated from other sources; 
and (6) checking various tracing Web sites. Tracers checked new leads produced by these 
tracing steps to confirm the address and telephone numbers for the sample members. When the 
information was confirmed, the case was returned to TSU for telephone interviewing. If the 
information could not be confirmed (e.g., there were no working telephone numbers or numbers 
for relevant neighborhood sources were unpublished), and the sample member was thought to be 
located in one of the geographic clusters, the case was assigned to field interviewers for locating. 

d. Field Locating 

Locating activities were performed by field interviewers, concurrent with their efforts to 
interview sample members. Since the costs of conducting field locating were high, field locating 
efforts were implemented only when less costly efforts were exhausted. Sample members were 
identified as needing field locating/interviewing if they were not located using CATI-locating 
and centralized intensive tracing. Additionally, sample members who were located by telephone 
but initially refused to participate were identified as potential field cases. 
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Thirty geographic clusters of sample members were identified and staffed with field 
interviewers. The interviewers were trained to locate and interview sample members using a 
laptop computer. Field interviewers were provided with a checklist which included sample 
questions to help with tracing operations and that demonstrated the correct order in which 
tracing activities should be performed. The checklist was completed for each case to help 
identify sources considered to be most useful in locating sample members. Field interviewers 
documented every telephone call or field contact. 

Primary tracing sources included: current or former neighbors, postsecondary schools 
attended, past or present employer, social agencies’ records, and city and county offices. 
Secondary tracing sources included directory assistance, chambers of commerce, public libraries, 
the U.S. Postal Service, and Departments of Motor Vehicles. Other miscellaneous sources, useful 
in some cases, included small town police or sheriffs departments, fire departments or 
emergency rescue squads, local newspapers, public housing authorities, mobile home park 
managers, motel staff, probation officers, and permit issuing departments at the city level (new 
construction). A contact script guided interviewers in soliciting information from various 
sources. 

3. Interviewing 

a. Training of Interviewers 

The training program for telephone and field interviewers was designed to maximize 
active participation. Training for telephone interviewers and their supervisors, conducted 
immediately prior to the start of telephone interviewing, consisted of a study overview, review of 
confidentiality requirements, demonstration interview, question-by-question review of the 
BPS: 1996/2001 instrument, and hands-on practice exercises with the instrument, tracing module, 
and online coding modules. Interviewers were also trained in techniques for gaining cooperation 
with sample members, parents, and other contacts, as well as techniques for addressing the 
concerns of reluctant participants and avoiding refusals. Training for field interviewers and their 
supervisors similarly consisted of lectures, demonstrations, and hands-on practice exercises with 
the instrument and online coding modules. In addition, field interviewers were trained on field- 
specific operations, including the field management system and field tracing procedures. The 
BPS: 1996/2001 telephone and field interviewer training agendas and the table of contents from 
their respective training manuals are located in appendix E. 

b. Telephone Interviewing 

CATI locating and interviewing began in February 2001 upon completion of telephone 
interviewer training. CATI procedures included attempts to locate, gain cooperation from, and 
interview study sample members by telephone. 

Locating information gleaned from the advance locating sources described above and 
from prior interviews with the sample member was preloaded into the CATI system. Each case 
had a call roster with names and telephone numbers associated with the sample member (e.g., 
parents, other contacts such as friends or relatives, sample member) for the interviewers to call. 
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Up to five roster-lines were preloaded with contact information. Additional roster-lines were 
added when CATI tracing or intensive tracing produced new contact information. 

An automated call-scheduler, embedded within the CATI software, assigned cases to 
interviewers. This system allowed calls to be scheduled on the basis of established case priority, 
time of day, and history of success of prior calls at different times and on different days. 
Scheduler case assignment was designed to maximize the likelihood of contacting and 
interviewing sample members. Cases were assigned to various queues for this purpose. Some of 
the queues included new cases, Spanish language cases, initial refusals, and various appointment 
queues (e.g., firm appointments set by the sample member, appointments suggested by locator 
sources, and appointments for cases which were initial refusals). 

Once located, some cases required special treatment. To deal with those who initially 
refused to participate (including locator sources who acted as “gatekeepers,” preventing access 
to the sample member), certain interviewers were trained in refusal conversion techniques. 
Sample members and their locator sources who spoke only Spanish, primarily located in Puerto 
Rico, were assigned to bilingual interviewers. 

Results of CATI interviewing were monitored daily through the study Integrated 
Management System. Daily reports of production, with revised projections of future production 
to satisfy study requirements, were available to both NCES and contractor staff. 

Finally, in an effort to increase study response rates, a modest incentive was used with 
particular types of nonrespondents: (1) cases where the sample member initially refused the 
interview, (2) sample members for whom intensive tracing yielded a good mailing address, but 
no telephone number, and (3) cases identified as “hard to reach” (i.e., those with 15 or more call 
attempts, where contact had been established with the sample member but no appointment could 
be scheduled). The subsample of BPS:96/98 nonrespondents was offered an incentive as well, 
although because subsample members were expected to be difficult cases, their incentive was 
offered before any attempt was made to interview them. The incentive mailing consisted of a 
letter from the project director tailored to the specific type of nonrespondent (i.e., refusal or no 
telephone number/hard to reach). A $5 bill was included with the letter. Respondents received a 
check for an additional $15 when they completed the interview. The incentive letters, shown in 
appendix D, were mailed on a flow basis as respondents met one of the criteria described above. 
All cases assigned to field interviewers were automatically eligible to receive the incentive. 

c. Field Interviewing 

Field interviewing activities began upon completion of interviewer training and 
assignment of field cases, approximately 12 weeks after the start of CATI interviewing. CAPI 
procedures included attempts to locate, gain cooperation from, and interview study sample 
members either by telephone or in person. The goal of the field interviewing effort was to 
increase the response rate by locating hard to reach sample members and by persuading reluctant 
sample members to complete the interview. Field interviewers were often successful in gaining 
cooperation where CATI failed to do so for a number of reasons: (1) a sample member using 
Caller ID to screen out calls from our CATI call center may have been more inclined to answer 
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the phone when the field interviewer’s local telephone number was displayed, (2) many of the 
field interviewers were more experienced in refusal conversion, and (3) sample members were 
less likely to refuse in person. 

All sample members who were finalized in CATI and by TOPS as “unlocatable” were 
eligible for assignment to the field for CAPI interviewing. Sample members who had not 
completed the BPS: 1996/2001 interview at the time field interviewing began and who resided in 
an identified geographic cluster were immediately assigned to a field interviewer. Field 
interviewers were provided with a detailed case history documenting all prior activity taken for 
the case. Nonrespondent cases not in a geographic cluster were sent for additional intensive 
tracing with RTI’s TOPS unit. An additional mailing was sent to the best address identified for 
the sample member, and the case incentivized as “hard to reach.” 

Upon successfully locating sample members, field interviewers attempted to complete the 
interview using the same instrument used by telephone survey personnel. The field staff were 
supported by a computerized control system that tracked field assignments and assigned 
interview status codes. Daily reports tracked the field effort. 

C. The Integrated Management System 

All aspects of the study were under the control of an Integrated Management System 
(IMS). The IMS was a comprehensive set of desktop tools designed to give project staff and 
NCES access to a centralized, easily accessible repository for project data and documents. The 
BPS IMS consisted of several components, or modules: the management module, the Receipt 
Control System (RCS) module, and the CATI/CAPI module. 

The management module of the IMS contained tools and strategies to assist the project 
staff and the NCES project officer in managing the study. All information pertinent to the study 
could be found here, accessible via the World Wide Web, in a secure desktop environment. 
Available on the IMS were the current project schedule, monthly progress reports, daily data 
collection reports and status reports (available through the Receipt Control System described 
below), project plans and specifications, key project information and deliverables, instrument 
specifications, staff contacts, the project bibliography, and a document archive. Also accessible 
from the management module was a downloadable version of the CATI/CAPI instrument for 
testing and review. 

The Receipt Control System (RCS) is an integrated set of systems that was used to 
monitor all activities related to data collection, including tracing and locating. Through the RCS, 
project staff were able to perform stage-specific activities, track case status closely, identify 
problems early, and implement solutions effectively. The RCS’s locator data were used for a 
number of daily tasks related to sample maintenance. Specifically, the mailout program 
produced mailings to parent/contacts and sample members, the query system enabled 
administrators to review the locator information and status for a particular case, and the mail 
return system enabled project staff to update the locator database as mailings or reply sheets 
were returned or forwarding information was received. 
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Another component of the RCS was the Field Case Management System (FCMS) which 
controlled field interviewing activities. The FCMS allowed field staff to conduct tracing and 
interviewing activities, communicate with RTI staff via electronic mail, transmit completed 
cases, and receive new cases. The RCS also interacted with the TOPS database sending locator 
data between the two systems as necessary. 

The C ATI/C API module managed development of the C ATI/C API instrument within the 
Data Dictionary System (DDS). The DDS consisted of a set of linked relational files and 
associated utilities for developing and documenting the instrument. Developing the CATI/CAPI 
instrument with the DDS ensured that all variables were linked to their item/screen wording and 
were thoroughly documented. Also included within the CATI/CAPI module was online coding 
software (“user exits”) that collected detail on schools attended, enrollment, major, financial aid, 
occupation, and industry. 

D. The Variable Tracking System 

The central mechanism for constructing input files for the electronic codebook (ECB) 
developed by NCES is a software application called the Variable Tracking System (VTS). The 
VTS tracks and stores documentation for both interview and derived variables required for the 
ECB and Data Analysis System (DAS). This includes weighted and unweighted variable 
distributions, variable labels and codes, value labels, and a text field describing the development 
of each variable and the programming code used to construct it. Input files for the ECB and 
DAS systems are automatically produced by the VTS according to NCES specifications. 
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Successful data collection for BPS: 1996/2001 involved several steps: updating existing 
locating information for the sample member, attempting contacts at the available addresses, 
initiating intensive locating efforts when contacts failed, and completing the interview. 

Chapter 3 describes these data collection outcomes and examines the effectiveness of our data 
collection methods. 

A. Response Rates 

Overall contacting and interviewing results for BPS: 1996/2001 are presented in 
figure 3.1. The starting sample consisted of those sample members who participated in the first 
follow-up, BPS:96/98, plus sample members selected from among the BPS:96/98 
nonrespondents. Of those cases, less than 1 percent was excluded because the sample members 
were deceased. Among the remaining cases, 92.4 percent were successfully contacted and 95.5 
percent interviewed, given contact, for an overall unweighted response rate of 88.3 percent. 
Weighted response rates are presented separately in chapter 6. 

Table 3.1 shows a distribution of response rates by type of interview completed and prior 
response status. From the table, 97 percent completed full interviews, while the remaining 
3 percent completed less than a full interview, either in the form of a partial interview (i.e., 
sample members completing at least the interview section on enrollment history) or an 
abbreviated interview (i.e., a condensed version of the full interview containing key data 
elements from each of the five sections of the survey). Partial and abbreviated interview 
response rates have been combined for reporting purposes. 



Table 3.1— Completeness of the BPS: 1996/2001 interview by BPS: 96/98 response status 



Type of response 


Total' 


BPS:96/98 response status 


Respondents 


Nonrespondents 


Total 


100.0 


99.3 


0.7 


Full interview 


96.8 


96.2 


0.7 


Partial/abbreviated interview 


3.2 


3.1 


<0.1 



1 Percents based on BPS : 1 996/200 1 respondents. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 
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Figure 3.1. — Locating, contacting, and interviewing outcomes 



Sample 

100 % 




Respondent 

95% 



▼ 


Full interview 


97% 


Partial interview 


1% 


Abbreviated interview 


2% 





Nonrespondents 

5% 





Refusal 


71% 


Unavailable 


21% 


Time ran out 


3% 


Institutionalized 


3% 


Incapacitated 


1% 


Hearing impaired 


<1% 



NOTE: Detail may not sum to totals because of rounding. 
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A comparison of conditional interview rates (i.e., interview given contact) in table 3.2 
shows that contacting and interviewing rates varied according to prior response status. The 
percentage of sample members who were interviewed, given contact, was 96 percent for those 
interviewed in both NPSAS:96 and BPS:96/98. A 90 percent response rate (given contact) 
resulted from those sample members who were only interviewed in BPS:96/98, while NPSAS.96- 
only respondents had a response rate of 81 percent. When compared, NPSAS:96 nonrespondents 
(BPS:96/98-only respondents) were easier to both contact (t = -2.3; p<.05) and interview (t = -2.2; 
p<.05) than those who responded during NPSAS:96 but not during BPS:96/98. 

Contacting and interview rates by type of school, presented in table 3.3, show the same 
general results as in the prior follow-up (BPS:96/98). That is, students who attended private for- 
profit schools continued to be difficult to contact and students from 4-year institutions tended to be 
relatively easy to contact. As in the prior follow-up, interviewing rates varied little by institution 
type, ranging from 92 to 97 percent once the person was contacted. 



Table 3.2.— BPS:1996/2001 contact and interview rates by prior response status 



Prior response status 


Total 


Contacted 1 


Interviewed, given contact 2 


Total 


100.0 


92.4 


95.5 


Interviewed in NPSAS:96 and BPS:96/98 


97.5 


92.6 


95.7 


Interviewed in NPSAS:96 only 


1.0 


80.8 


81.3 


Interviewed in BPS:96/98 only 


1.6 


91.9 


90.5 



1 Percentages are based on the total within the row under consideration. 

2 Percentages are based on the number contacted within the row under consideration. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 



Table 3.3.— BPS:1996/2001 contact and interview rates by NPSAS:96 institutional sector 



Institutional sector 


Total 


Contacted 1 


Interviewed, given contact 2 


Total 


100.0 


92.4 


95.5 


Public less-than-2-year 


1.8 


92.8 


94.0 


Public 2-year 


12.2 


91.2 


94.8 


Public 4-year non-doctorate-granting 


19.9 


92.7 


96.0 


Public 4-year doctorate-granting 


24.0 


93.4 


96.5 


Private not-for-profit 2-year or less 


3.2 


88.4 


94.1 


Private not-for-profit 4-year non-doctorate-granting 


13.0 


93.1 


95.8 


Private not-for-profit 4-year doctorate-granting 


14.9 


95.8 


95.7 


Private for-profit less-than-2-year 


6.6 


85.5 


92.6 


Private for-profit 2-year or more 


4.6 


89.5 


93.6 



1 Percentages are based on the total within the row under consideration. 

2 Percentages are based on the number contacted within the row under consideration. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 
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B. Respondent Locating and Interviewing Outcomes 

Tracing and locating sample members in any longitudinal study is a complex task, 
oftentimes requiring the use of multiple sources of information to locate the current address and 
telephone number of a sample member. Successful completion of the BPS: 1996/2001 locating 
effort required a combination of pre-CATI locating activities (advance tracing, updating the BPS 
locator database, mailings), telephone tracing during the CATI phase of data collection (tracing 
activities conducted by telephone interviewers/supervisors), centralized tracing efforts (tracing 
activities conducted by the TOPS unit), and tracing by field interviewers. Descriptions of these 
locating activities are presented in detail in chapter 2 and highlighted throughout this section. 

1. Locating 

Pre-CATI locating. An important first step in contacting and interviewing 
BPS: 1996/2001 sample members was the updating of address information collected during the 
BPS:96/98 and NPSAS:96 interviews, as well as any new information collected since the last 
interview. This new information could have been obtained from annual matches to the Central 
Processing System for federal financial aid applicants occurring as part of sample maintenance or 
from batch processing to the NCOA and Telematch databases. In addition, sample members’ 
parents and other individuals identified by the sample member in prior interviews were contacted 
by mail for address updates for the sample members. Address information was available for 
parents or other locators for 81 percent of the sample, and address update forms were received 
from 32 percent of those who were sent the mailing. 

One week before the start of CATI data collection, a second mailing was sent to inform 
sample members of the upcoming telephone interview and to request that they correct and return 
an address update sheet. The prenotification mailing was sent to every sample member with the 
exception of 38 cases for whom no address information was available. Address update sheets with 
new or confirmed information were received from 8 percent of those sent the mailing. As shown 
in table 3.4, contact (t = -13.9; p<.001) and interview (t = 16.0; p<.001) rates were higher for 
those respondents who returned an address update sheet or had it returned on their behalf. 



Table 3.4. — BPS:1996/2001 contact and interview rates by return of address update form 



Mailing 


Total 


Contacted 1 


Interviewed, given contact 2 


Total 


100.0 


92.4 


95.5 


Parent/other contact mailing 








Returned update form 


25.5 


97.9 


97.8 


Did not return update form 


74.5 


90.6 


94.6 


Sample member mailing 






99.3 


Returned update form 


8.0 


99.6 


Did not return update form 


92.0 


91.8 


95.1 



1 Percentages are based on the total within the row under consideration. 

2 Percentages are based on the number contacted within the row under consideration. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 
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Intensive tracing. Intensive tracing was conducted by the TOPS unit at RTI both prior to 
data collection, for cases with no contact information at all (advance tracing), and during data 
collection, for cases where all leads were exhausted. A number of locating sources were used to 
trace sample members — including consumer databases, directory assistance, and Internet 
sources — in two tiers of tracing; the second, more intensive tier was used when the first failed to 
locate the sample member. Results of the intensive tracing effort are shown in table 3.5. 



Table 3.5. — BPS: 1996/2001 contact and interview rates by TOPS tracing stages 



Intensive tracing stages 


Total 


Contacted 1 


Interviewed, given contact 2 


Total cases traced 


100.0 


92.4 


95.5 


Pre-CATI tracing 








TOPS advance tracing (pre-CATI) 


1.3 


80.7 


84.4 


Advance tracing not required 


98.7 


92.6 


95.6 


CATI tracing 








TOPS tier 1 tracing 


30.6 


76.0 


92.5 


Tier 1 tracing not required 


69.5 


99.7 


96.5 


TOPS tier 2 tracing 


1.7 


44.4 


96.2 


Tier 2 tracing not required 


98.3 


93.3 


95.5 



1 Percentages are based on the total within the row under consideration. 

2 Percentages are based on the number contacted within the row under consideration. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 

Advance tracing prior to the start of data collection was very successful. Of the cases 
traced, 81 percent was contacted and of those, 84 percent was interviewed. A total of 3 1 percent 
of sample members was traced using the first tier, resulting in 76 percent contacted and 93 percent 
(of those contacted) interviewed. The second tier tracing was implemented for 1 .7 percent of all 
cases, 44 percent of whom was contacted and 96 percent of those contacted interviewed. 

2. Refusal Conversion 

Refusal conversion procedures were used to gain cooperation from individuals who, over 
the course of data collection, refused to participate when contacted by telephone interviewers. 
Eighteen percent refused to be interviewed at some point during data collection and 74 percent of 
these refusals were successfully converted into completed interviews. 

The refusal rate and success of converting refusals varied according to the sample 
member’s response status on the previous interviews and type of school, as shown in tables 3.6 
and 3.7, respectively. As expected, initial refusal rates were lower (t = 4.7; p <.00 1 ) and refusal 
conversion rates higher (t = —4.7; p <.001) for those who had participated in both the NPSAS:96 
andBPS:96/98 interviews. 
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Table 3.6. — BPS:1996/2001 refusal and refusal conversion rates by prior response status 



Prior response status 


Total 


Ever refused BPS:1996/2001 
Interview 1 


Interviewed, given refusal 


Total 


100.0 


18.0 


73.9 


Interviewed in NPSAS:96 and BPS:96/98 


97.5 


17.7 


74.9 


Interviewed in NPSAS:96 only 


1.0 


32.3 


40.6 


Interviewed in BPS:96/98 only 


1.6 


26.9 


58.1 



1 Percentages are based on the total within the row under consideration. 

2 Percentages are based on the number ever refused within the row under consideration. 
Study: 1996/2001 (BPS: 1996/2001). 



Table 3.7. — BPS:1996/2001 refusal and refusal conversion rates by NPSAS:96 institutional 
sector 



Institutional sector 


Total 


Ever refused BPS:1996/2001 
Interview 1 


Interviewed, given 
refusal 2 


Total 


100.0 


18.0 


73.9 


Public less-than-2-year 


1.8 


24.9 


80.0 


Public 2-year 


12.2 


22.3 


71.3 


Public 4-year non-doctorate-granting 


19.9 


18.0 


76.8 


Public 4-year doctorate-granting 


24.0 


16.1 


76.4 


Private not-for-profit 2-year or less 


3.2 


21.4 


70.0* 


Private not-for-profit 4-year non-doctorate-granting 


13.0 


16.5 


74.7 


Private not-for-profit 4-year doctorate-granting 


14.9 


14.5 


75.8 


Private for-profit less-than-2-year 


6.6 


21.2 


64.6 


Private for-profit 2-year or more 


4.6 


22.4 


68.9 



'Percentages are based on the total within the row under consideration. 

Percentages are based on the number ever refused within the row under consideration. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS:1996/2001). 



3. Field Interviewing 

Cases were selected for field interviewing if they could not be located in CATI or had been 
extensively worked in CATI but the subject could not be reached (e.g., calls always reached an 
answering machine). Only cases located in close geographic proximity to one of the 30 
geographic field clusters selected for BPS:1996/2001 were eligible for field interviewing. A total 
of 1 1.7 percent of cases were assigned to field interviewers. As shown in table 3.8, 80 percent of 
cases sent to the field was contacted, either in CATI or in the field, and 90 percent of those 
contacted was interviewed. Field interviewing rates by sector of the NPSAS:96 institution are 
presented in table 3.8. 





BEST COPY AVAILABLE 



Chapter 3: Data Collection Outcomes 



Table 3.8. — BPS:1 996/2001 field contact and interview rates by NPSAS:96 institutional 
sector 



Institutional sector 


Total 


Ever in field 1 


Contacted 2 


Interviewed, given 
contact 3 


Total 


100.0 


11.7 


79.8 


90.2 


Public 2-year or less 4 


14.0 


12.4 


82.1 


90.5 


Public 4-year non-doctorate-granting 


19.9 


9.2 


82.0 


87.1 


Public 4-year doctorate-granting 


24.0 


10.8 


80.1 


89.7 


Private not-for-profit 2-year or less 


3.2 


14.1 


76.1 


97.1 


Private not-for-profit 4-year non-doctorate-granting 


13.0 


10.6 


80.3 


90.4 


Private not-for-profit 4-year doctorate-granting 


14.9 


10.1 


84.5 


94.7 


Private for-profit less-than-2-year 


6.6 


21.1 


69.9 


86.0 


Private for-profit 2-year or more 


4.6 


19.4 


78.3 


91.7 



1 Percentages are based on the total within the row under consideration. 

2 Percentages are based on the number ever in field within the row under consideration. 

Percentages are based on the number contacted within the row under consideration. 

4 Public less-than-2-year and public 2-year sectors were combined due to the small number of cases. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study. 1996/2001 (BPS: 1996/2001). 

4. Nonresponse Incentive 

As discussed in chapter 2, incentives were offered to targeted sample members in order to 
encourage participation and help to compensate them for the time required to complete the 
interview, thereby reducing the number of nonrespondents. Response rates by nonrespondent type 
are provided in table 3.9 and by sector of the NPSAS:96 institution in table 3.10. The first group 
of potential nonrespondents to BPS: 1996/2001 included refusals that had not been converted, 
hard-to-reach sample members, and unbeatable sample members. All nonrespondents from 
BPS:96/98 were offered an incentive because they already had not responded (either refused or 
could not be located) in the prior interview. Interviews were completed with 72 percent of the 
incentivized cases. 



Table 3.9. — BPS:1996/2001 interview rates by incentive group type 



Incentive group 


Percent offered incentive 


Interviewed, given incentive 1 


Total 


100.0 


72.2 


BPS: 1996/2001 potential nonrespondent 


97.7 


72.2 


BPS:96/98 nonrespondent 


2.3 


69.9 



1 Percentages are based on the number offered incentive within the row under consideration. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 
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Table 3.10. — BPS:1996/2001 incentive and interview rates by NPSAS:96 institutional sector 



Institutional sector 


Total 1 


Incentive offered 2 


Interviewed, given incentive 3 


Total 


100.0 


39.7 


72.2 


Public less- than-2 -year 


1.8 


44.2 


72.5 


Public 2-year 


12.2 


46.2 


72.3 


Public 4-year non-doctorate-granting 


19.9 


38.9 


73.4 


Public 4-year doctorate-granting 


24.0 


35.5 


73.8 


Private not-for-profit 2-year or less 


3.2 


49.5 


66.7 


Private not-for-profit 4-year non-doctorate-granting 


13.0 


36.5 


71.0 


Private not-for-profit 4-year doctorate-granting 


14.9 


33.7 


76.2 


Private for-profit less-than-2-year 


6.6 


51.2 


63.7 


Private for-profit 2-year or more 


4.6 


51.3 


71.2 



'Detail may not add to total because of rounding. 

Percentages are based on the total within the row under consideration. 

Percentages are based on the number of incentives offered within the row under consideration. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 

C. Interview Burden and Effort 

This section reviews the burden and effort associated with the BPS: 1996/2001 interview. 
The first section examines the burden on respondents by examining the time required to complete 
the interview overall and by section. We then consider the effort required to locate and interview 
sample members for the study by considering the average time and number of calls that were 
required to complete interviews. 

1. Timing 

During instrument development, project staff embedded “time stamps” at the start and end 
of the interview, at the start of each section, and around each interview screen (which could 
include multiple, related items). The time stamps measured elapsed time, allowing project staff to 
monitor the time required to complete specific interview items, online coding programs, sections 
of the interview, and the entire interview. 

Average time to administer the BPS: 1996/2001 interview, overall and by section, is shown 
in table 3.11. Sections are listed in the table in the order in which they occurred in the interview. 
The number of cases completing each section fluctuated because some respondents broke off the 
interview early (partial interview); the timing figures for partial interviews are included through 
the end of the section prior to the point where the interview was terminated. In addition, sample 
members enrolled at the time of the interview who considered themselves to be primarily students 
(rather than employees) were skipped around the section on postenrollment employment. As a 
result, the number completing that section was low relative to the other sections. 

Average time by BPS:96/98 response status is presented in table 3.11. BPS:96/98 
nonrespondents were asked to provide data back to 1996, the time of the NPSAS base-year 
interview. Consequently, BPS:96/98 respondents took significantly less time than BPS:96/98 
nonrespondents to complete the 2001 interview (t = 3.9; p<.001). As shown in table 3.12, the 
shortest interview times can, in general, be attributed to those sample members who had no 
enrollment since their last interview (t = —43.6; p<.001). Those reporting no additional enrollment 
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skipped most of the section on enrollment history (section B), nearly all of the section on 
undergraduate enrollment (section C), and half of the section on postenrollment employment 
(section E), and took, on average, 1 1.9 minutes to administer, compared with 19.3 minutes for 
those who had been enrolled since their last interview. Likewise, the short interview times of 
students in less-than-2-year institutions, presented in table 3.13, can be attributed to their low 
enrollment rate since the last interview. 



Table 3.11. — Average elapsed minutes to complete the BPS:1996/2001 interview, by 
BPS:96/98 response status and by section 



Section 


Total 


BPS:96/98 

respondent 


BPS:96/98 

nonrespondent 


Average time 


Average time 


Average time 


Full interview 


17.8 


17.8 


21.4 


Enrollment history 


2.5 


2.5 


3.6 


Undergraduate enrollment 


4.3 


4.3 


4.9 


Graduate enrollment 


1.0 


1.0 


1.0 


Postenrollment employment 1 


7.1 


7.1 


7.9 


Background 


4.3 


4.3 


4.6 



1 Excludes respondents who skipped the postenrollment employment section because they were enrolled at the time of the interview 



and considered themselves to be primarily students. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 



Table 3.12. — Average elapsed minutes to complete the BPS:1996/2001 interview, by 
enrollment since previous interview and by section 



Section 


Total 


No enrollment since 
last interview 


Some enrollment since 
last interview 


Average time 


Average time 


Average time 


Full interview 


17.8 


11.9 


19.3 


Enrollment history 


2.5 


0.5 


3.0 


Undergraduate enrollment 


4.3 


0.5 


5.2 


Graduate enrollment 


1.0 


0.8 


1.0 


Postenrollment employment 1 


7.1 


5.2 


7.8 


Background 


4.3 


4.8 


4.2 



1 Excludes respondents who skipped the postenrollment employment section because they were enrolled at the time of the 
interview and considered themselves to be primarily students. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). ' 

2. Telephone Interviewer Hours 

A total of 15,291 telephone interviewer hours (exclusive of CATI training, supervision, 
monitoring, and quality circle meetings) were expended to obtain interviews from the sample 
members who completed full or partial CATI interviews. On average, telephone interviewers 
spent 1.91 hours per completed interview. With the average time to administer the interview at 18 
minutes, the large majority of interviewer time was spent in other activities, primarily attempting 
to locate and contact the sample member. 
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3. Number of Calls and Call Screening 

Number of calls. Tables 3.14 and 3.15 show the number of telephone calls made by 
telephone interviewers to sample members by BPS:96/98 response status and by institutional 
sector. Telephone interviewers made an average of 21 calls per sample member. 1 BPS:1996/2001 
respondents received an average of 19 calls, while nonrespondents averaged 34 calls over the 
course of the data collection period. 

Call screening. Interview nonresponse is an increasing problem for CATI and CAPI 
studies, affecting the cost of data collection and the quality of the resulting data. Call screening, 
defined as the use of devices such as telephone answering machines, Caller ID, call-blocking, or 
privacy managers to avoid unwanted telephone calls, is an increasing problem for all studies 
conducted by telephone. Call screening poses a significant obstacle to contacting sample members 
and can, in turn, affect the representativeness of the data, lower the response rate, and increase 
project costs by requiring additional call attempts and interviewer time. 

Approximately 40 percent of the telephone calls placed for BPS: 1996/2001 telephone 
interviewing reached an answering machine, and nearly three-quarters (74 percent) of the cases 
had at least one answering machine event. Considerably more calls were required to interview 
those with answering machines (average of 23 calls per case) than those without (average of eight 
calls per case; t = -29.4; p<.001). Similarly, cases with no answering machine events had a much 
lower rate of ever refusing (10 percent) compared to 20 percent with at least one answering 
machine contact (t = -1 1 .9; p<.001). 



1 These figures were captured by the study’s computerized receipt control system and are based on calls made 
by telephone interviewers. Calls made by TOPS (in attempt to locate sample members) and field interviewers are 
excluded. 
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Table 3.13. — Average elapsed minutes to complete the BPS:1 996/2001 interview, by section and by NPSAS:96 institutional 
sector 
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Table 3.14. — Number of calls made to sample members, by response status 



Category 


Average calls per case 


Percent of calls reaching 
answering machines 1 


Total 


21.1 


40.4 


BPS:1996/2001 response status 






Interviewed 


19.4 


40.3 


Not interviewed 


34.0 


41.0 


Prior response status 






Interviewed in NPSAS:96 and BPS: 96/98 


21.1 


40.4 


Interviewed in NPSAS:96 only 


22.4 


37.7 


Interviewed in BPS:96/98 only 


21.2 


43.7 



1 Percentages are based on the number of calls within the row under consideration. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 



Table 3.15.— Number of calls made to sample members, by NPSAS:96 institutional sector 



Institutional sector 


Average calls per case 


Percent of calls reaching 
answering machines 1 


Total 


21.1 


40.4 


Public less-than-2-year 


19.6 


28.5 


Public 2-year 


22.1 


37.8 


Public 4-year non-doctorate-granting 


21.2 


41.9 


Public 4-year doctorate-granting 


21.4 


42.0 


Private not-for-profit 2-year or less 


21.7 


33.9 


Private not-for-profit 4-year non-doctorate-granting 


20.0 


42.5 


Private not-for-profit 4-year doctorate-granting 


19.4 


45.3 


Private for-profit less-than-2-year 


23.9 


31.5 


Private for-profit 2 -year or more 


22.3 


36.1 



1 Percentages are based on the number of calls within the row under consideration. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 
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Chapter 4 
Evaluation of Data Quality 



Evaluations of data quality are effective in identifying problems with the instrument that 
can inform instrument design for future studies. Several types of evaluations were planned for 
BPS: 1996/2001 as part of the overall study design, including analyses of indeterminate 
responses, help text accesses, online coding, quality circle meetings, and quality control 
monitoring of interviews. These are described in the sections below. 

A. Indeterminate Responses 

Every item in the CATI/CAPI interview accommodated indeterminate responses — that is, 
“don’t know” and “refusal” responses — from sample members, recorded using the computer 
function keys F3 and F4. In general, refusal responses to interview questions tend to be common 
for items considered sensitive by the respondent, such as income and credit card debt in the 
BPS: 1996/2001 interview, while “don’t know” responses may be provided for a number of 
reasons, the most obvious reason being that the answer is truly unknown or in some way 
inappropriate for the respondent. Don’t know responses may also be evoked when (1) question 
wording is not understood by the respondent, (2) there is hesitancy on the part of the respondent 
to provide “best guess” responses, and (3) the respondent implicitly refuses to answer a question. 
Refusal and don’t know responses introduce indeterminacies in the data set. While the 
preference is to avoid indeterminate responses entirely, they must be resolved by imputation or 
other means during analysis following data collection. 

Overall item nonresponse rates were low, with only 9 of the 445 items in the interview 
containing over 10 percent missing data. These items are shown in table 4.1. Item nonresponse 
rates are calculated based on the number of sample members for whom the item was applicable 
and asked. 



Table 4.1. — Indeterminate response rates for items with more than 10 percent “don’t 
know” or “refused” 



Item description 


Percent don’t know 


Percent refused 


Combined Percent 


Cumulative undergraduate GPA 


10.6 


0.4 


11.0 


Lifetime Learning tax credit 1999 (undergraduate) 


11.4 


0.2 


11.7 


Lifetime Learning tax credit 1999 (graduate) 


11.7 


0.0 


11.7 


Gross annual salary for current job 


13.6 


6.2 


19.8 


Gross annual salary for first post-enrollment job 


20.4 


4.0 


24.4 


Gross salary for 2000 


7.8 


4.9 


12.8 


Spouse’s gross salary for 2000 


13.0 


7.5 


20.5 


Spouse’s monthly education loan payment 


10.2 


1.6 


11.8 


Total balance due on all credit cards 


7.1 


5.7 


12.9 



NOTE: Statistics are based on sample members for whom specific items were applicable and asked. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 
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Six of the items with high rates of combined nonresponse pertained to income and 
personal finances. Many respondents were reluctant to provide information about personal and 
family finances (refusals) and, among those who were not, many simply did not know. Grade 
point average also had more than 10 percent nonresponse, most likely because of respondents' 
difficulty in recalling this information as well as its sensitive nature. The other two items with 
more than 10 percent nonresponse asked about the Lifetime Learning tax credit. The high rate of 
“don’t know” responses for these items is likely due to respondents’ not knowing about the 
credit. 



The CATI/CAPI instrument was designed to convert “don’t know” responses, if possible, 
for three of these items. Sample members who responded with “don’t know” to the GPA item 
were asked to provide a letter grade range (e.g., mostly A's, A's and B's, mostly B's, etc.) instead 
of a number; their conversion rate was 94 percent for an item level response rate of 99 percent. 1 
When offered the opportunity to specify annual salary in terms of an hourly, weekly, twice 
monthly, or monthly amount, 91 percent of those who answered “don’t know” to the question of 
current annual salary, and 92 percent of those who answered “don’t know” to first 
postenrollment job salary, were converted, for an item level response rate of 93 and 94 percent, 
respectively. 

B. Help Text 

Online help text was available for every screen in the CATI/CAPI instrument. Help text 
screens included definitions of terms used in the question wording and the type of information 
requested. Having additional information available at the touch of a function key was beneficial 
to interviewers, particularly at the beginning of data collection, to immediately minimize 
respondents' confusion with questions while still on the telephone with a respondent. 

Counters were used to determine the number of times each help screen was accessed, 
making it possible to identify items that were confusing to the interviewer or respondent. An 
analysis of the number of help text accesses revealed only one item for which the rate of help 
text usage was greater than 4 percent: “ Do you (or your spouse) currently receive any of the 
following forms of untaxed income? TANF (AFDC), Social Security benefits, workers 
compensation, disability payments, child support, food stamps. ” It is likely that respondents 
were unfamiliar with some of these sources of untaxed income. 

C. Online Coding 

The BPS instrument included tools that allowed computer-assisted online coding of 
literal responses for postsecondary institution, major field of study, occupation, and industry. 
Online coding systems were designed to improve data quality by capitalizing on the availability 
of the respondent to clarify coding choices at the time the coding was performed. To assist with 
the online coding process, interviewers were trained to use effective probing techniques and 
given extensive, supervised practice. While the interview was being conducted, interviewers 
were able to clarify the text string provided and request additional information if it could not be 



1 Conversion of “don’t know” responses to the GPA item was not attempted in the abbreviated interview 
(19 “don’t know” cases). The response rate after conversion, inclusive of abbreviated cases, is 98.8 percent. 
Excluding abbreviated cases results in a response rate of 99. 1 percent. 
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coded on the first attempt. Because both the literal string and selected code were captured in the 
data file for field of study, occupation, and industry responses, subsequent quality control 
recoding by a coding expert was easily incorporated into data collection procedures. 

Institution coding was used to assign a six-digit Integrated Postsecondary Education Data 
System (IPEDS) identifier for each postsecondary institution the respondent reported attending, 
other than those collected during the base year and first follow-up interviews. To facilitate 
coding, the IPEDS coding system asked for the state in which the school was located, followed 
by the city, and finally the name of the postsecondary institution. The system relied on a look-up 
table, or coding dictionary, of institutions constructed from the IPEDS institutional database. 
Additional information in the dictionary, such as institutional level and control, was retrieved for 
later use (e.g., branching) once an institution was properly coded. 

Major field of study, occupation, and industry coding utilized a dictionary of word/code 
associations. The online procedures for these coding operations consisted of four steps: (1) the 
interviewer keyed the verbatim text provided by the respondent; (2) the dictionary system 
displayed similar words for those words in the text string that were not in the dictionary, giving 
the interviewer the option of accepting a word that would help in terms of coding or ignoring a 
word that was not applicable; (3) standard descriptors associated with identified codes were 
displayed for the interviewer; and (4) the interviewer selected the appropriate standard descriptor 
from the list, with the assistance of the respondent as needed. 

Several steps were taken after data collection to ensure the completion and accuracy of 
the online coding procedures. The first step was upcoding, where project staff reviewed IPEDS 
schools, majors, occupations, and industries that interviewers marked as “uncodeable” and coded 
the strings into the appropriate categories, where possible. Table 4.2 presents the proportion of 
coding attempts that were uncodeable by interviewers but were subsequently coded by project 
staff. 



Table 4.2. — Success of online coding procedures: upcoding 



Type of coding 


Total number 
of responses 
coded 1 


Number 

originally 

uncodeable 


Number 

upcoded 


Percent 

upcoded 2 


Final number 
of uncodeable 


Percent 

uncodeable 3 


Total 


19,434 


528 


231 


43.8 


297 


1.5 


Institution/IPEDS 


2,611 


451 


180 


39.9 


271 


10.4 


Major field of study 


2,518 


16 


10 


62.5 


6 


0.2 


Occupation 


6,469 


57 


38 


66.7 


19 


0.3 


Industry 


7,836 


4 


3 


75.0 


1 


<0.1 



'Total number of coding attempts may exceed the total number of completed interviews as some items were asked multiple times 
in an interview. 

Percentages are based on the number originally uncodeable within the row under consideration. 

Percentages are based on the total number of responses coded within the row under consideration. 

NOTE: IPEDS schools, majors, occupations, and industries were reviewed by project staff and either determined to be uncodable 
or “upcoded” into one of the existing response categories as appropriate. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001), 




41 



57 



Chapter 4: Evaluation of Data Quality 



Institution coding had the highest rate of uncodeable responses prior to upcoding as well 
as the lowest rate of successful upcoding. This is due, in large part, to the different methods used 
in coding: IPEDS coding required a precise match between the name of the institution entered 
and the IPEDS database, while major, industry, and occupation were coded by assigning 
verbatim strings to categories. Thus, while major, occupation, and industry strings were simply 
coded into categories, institutions required an exact match. Two additional factors contributed to 
this high rate of uncodeable schools: (1) the 1997-98 IPEDS database was used for upcoding, 
and, while this was the most recent version available, it did not include the newest schools; and 
(2) foreign institutions were not included in the IPEDS database and thus were not codeable 
either online or during post-data collection coding procedures. 

Of the remaining codeable fields, very few literal strings given by respondents were 
uncodeable. Major, occupation, and industry each had less than 1 percent originally uncodeable. 
Project staff were successful in upcoding the majority of these initially uncodeable strings. 

The second step to ensuring data quality was the recoding process. Ten percent of the 
cases were randomly sampled and their major, occupation, and industry coding results examined. 
The verbatim strings were evaluated for completeness and for the appropriateness of the assigned 
codes. Upon review of the string and assigned code, project staff determined whether a different 
code should be assigned. Table 4.3 shows the results of online coding procedures. Across the 
entire interview, verbatim strings were recoded for 8 percent of the coding attempts sampled, 
excluding those which could be upcoded (Table 4.2). The percentage recoded for occupation 
was higher than expected at 13 percent. Project staff noted that some occupation categories (e.g., 
“managers - midlevel” and “managers - supervisory”) were difficult to distinguish. Adding 
more examples to the descriptions of occupational categories that appear on screen may help to 
avert this problem in the future. 



Table 4.3.— Success of online coding procedures: recoding 



Type of coding 


Coding attempts 
sampled 


Number of sampled 
cases recoded 


Percent of sampled 
cases recoded 1 


Total 


1,703 


139 


8.2 


Major field of study 


243 


8 


3.3 


Occupation 


802 


101 


12.6 


Industry 


658 


30 


4.6 



'Percentages are based on the coding attempts sampled within the row under consideration. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 



D. CATI Quality Circle Meetings 

Quality circle meetings were a vital component in ensuring data quality and consistency 
throughout the data collection period. During these regularly scheduled meetings, interviewers, 
supervisors, and project staff met to discuss issues pertinent to conducting CATI interviews in 
the most effective manner. Telephone interviewers attended the quality circle meetings on a 
rotating basis. Helpful tips and summaries of discussions and decisions were prepared and 
distributed by project staff to all telephone interviewers and their supervisors. Meeting minutes 
were available both online and in hard copy. 
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The quality circle meetings were instrumental in providing prompt and precise solutions 
to problems encountered by the interviewers. Some slight modifications were made to the CATI 
instrument as a result of these meetings. Examples of issues raised in quality circle meetings 
included: 

Revising help text. Help text was modified based on telephone interviewer feedback. 
Modifications included clarification of definitions and additional information to aid interviewers 
in coding. 

Reviewing/entering case-level comments. The importance of reviewing and entering 
comments pertaining to contacting attempts for each sample member was stressed throughout 
data collection. Telephone interviewers were encouraged to always check the record of calls to 
see what happened previously on a particular case. This enabled interviewers to contact the 
respondent at the appropriate time and telephone number. By entering effective comments, a 
detailed description of events was created that proved helpful to interviewers who later accessed 
the case. 

Problem sheets. Problem sheets were a means for interviewers to report instrument or 
interviewing problems. Project staff reviewed these problem sheets in order to determine the 
nature of the problems encountered and resolve them accordingly. Solutions to recurrent 
problems were addressed in quality circle meetings and in the minutes of these meetings. 

Coding. Considerable emphasis was placed on properly coding responses. Since most 
respondents did not provide verbatim responses that exactly matched our response categories, 
telephone interviewers were instructed on how to fit those responses into the “best” possible 
category. In addition, telephone interviewers and project staff discussed solutions for how best 
to code items using the online coding system. 

Changes to the instrument. Telephone interviewers were notified if a change in 
programming code had to be made to fix a problem with the instrument or supporting screens. 

E. Quality Control Monitoring 

Monitoring telephone data collection serves a number of goals, all aimed at maintaining a 
high level of data quality. Monitoring in BPS: 1996/2001 helped to meet three important quality 
objectives: (1) reduction in the number of interviewer errors, (2) improvement in interviewer 
performance by reinforcement of good interviewing practices, and (3) assessment of the quality 
of the data being collected. 

In order to ensure data quality, CATI interviews were evaluated by supervisors using a 
silent quantitative monitoring system. Monitors listened to and simultaneously viewed the 
progress of the interview using remote monitoring telephone and computer equipment. Monitors 
listened to up to 20 questions during an ongoing interview and, for each question, evaluated two 
aspects of the interviewer-respondent interchange: whether the interviewer (1) delivered the 
question correctly (error in delivery) and (2) keyed the appropriate response (error in data entry). 
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Measures of question delivery and data entry were developed and daily, weekly, and 
cumulative reports produced. Monitoring took place during the first 3 1 weeks of data collection, 
with a total of 19,962 items monitored during that time. After the 12th week of data collection, 
monitoring efforts were scaled back due to the lighter caseload being worked by telephone 
interviewers, the greater experience of the remaining interviewers, and the satisfaction by project 
staff that the process was proceeding smoothly. Figure 4.1 shows error rates for question 
delivery; figure 4.2 shows error rates for data entry. 2 Both graphs provide upper and lower 
control limits for these measures. 3 The first two weeks reflect the learning curve expected at the 
start of any study during which interviewers are developing their skills with the instrument. 
During this time, error rates of up to 2.4 percent were recorded. Throughout the remainder of the 
monitoring period, error rates remained within acceptable limits, never exceeding 0.6 percent. 



2 Weeks 13 through 31 are not shown in the figures due to the low rate of monitoring. Analysis of 
interviewer behavior based on the few observations from this period is not useful. No errors were recorded after 
week 1 1 . 

3 The upper and lower control limits were defined by three times the standard error of the proportion of 
errors to the number of questions observed for the period (upper control limit: +3 times the standard error; lower 
control limit: -3 times the standard error). 
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Figure 4.1--Monitoring error rates for CATI question delivery 




NOTE: The upper and lower control limits were defined by three times the standard error of the proportion of errors to the number of 
questions observed for the period (upper control limit: +3 times the standard error; lower control limit: -3 times the standard error). 
Weeks 1 3 through 3 1 are not shown in the figures due to the low rate of monitoring. Analysis of interviewer behavior based on the 
few observations from this period is not useful. No errors were recorded after week 1 1 . 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 



Figure 4.2--Monitoring error rates for CATI data entry 
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NOTE: The upper and lower control limits were defined by three times the standard error of the proportion of errors to the number of 
questions observed for the period (upper control limit: +3 times the standard error; lower control limit: -3 times the standard error). 
Weeks 1 3 through 3 1 are not shown in the figures due to the low rate of monitoring. Analysis of interviewer behavior based on the 
few observations from this period is not useful. No errors were recorded after week 1 1 . 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 
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As the third of three interviews with the BPS:96 cohort, the data files for BPS: 1996/2001 
contain a number of component data files from a variety of sources in addition to those files 
created from the interview itself. These files are available as a set of restricted research files, 
fully documented by an Electronic Codebook (ECB), and as a public release Data Analysis 
System (DAS), which also contains full documentation. 1 This chapter describes each data file 
and details the documentation process. 

A. Overview of the BPS: 1996/2001 Data Files 

Data obtained from the BPS:1996/2001 student interview are contained in restricted data 
files, documented by an ECB, which are available to researchers who have applied for and 
received authorization from NCES to access restricted research files. Included in the 
BPS: 1996/2001 restricted data are the data files and ECB documentation for eligible first-time 
beginning students (FTBs) interviewed during the base year interview, NPSAS:96, and for the 
first follow-up interview, BPS:96/98. 

The BPS: 1996/2001 data contain the following files: 

2001 Derived Variables File — Contains the composite (derived) variables developed for 
use on the BPS: 1996/2001 public release DAS. [F01DERI.DAT] 

2001 CA TI Student Data File — Provides student-level raw data collected from the 
sample members who responded to the BPS: 1996/2001 interview. The file excludes any 
CATI “verbatim” variables, which are on the Verbatim Data File described below. 
[F01STUD.DAT] 

2001 CATI School Data File — Provides institution data obtained from the student 
interview. Although this is a student-level file, a sample member may have more than 
one record in the file since there is a separate record for each postsecondary institution 
attended since the last interview (up to nine institutions). [F01SCH.DAT] 

2001 Coding Results File — Contains the verbatim text and resulting codes for 
undergraduate major and, for employed respondents, industry and occupation. 
[F01CODE.DAT] 



1 The ECB and DAS are both fully documented software products available from the National Center for 
Education Statistics (NCES). 
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2001 Verbatim Data File — Contains item-level records (i.e., one record per variable) for 
text variables collected in CATI. It is possible to have multiple records per student or no 
records for a student. [F01VERB.DAT] 

2001 CATI Preload File — Contains those data preloaded from earlier data collections for 
use during the 2001 student interview for the CATI respondents. [F01PREL.DAT] 

2001 CPS Data Files — Contains data received from matches to the Central Processing 
System (CPS) database for each consecutive academic year (AY) since the last follow- 
up, as follows: 

• AY 1998-99: 5,064 BPS:96 sample members matched [CPS99.DAT] 

• AY 1999-2000 : 3,277 BPS:96 sample members matched [CPS00.DAT] 

• AY 2000-01: 2,131 BPS:96 sample members matched [CPS01.DAT] 

NSLDS Pell Data File — Contains raw grant-level data received from the National 
Student Loan Data System (NSLDS) for the 5,258 BPS sample members who received 
Pell Grants between May 1, 1995, and July 1, 2001. Provides a history file with separate 
records for each transaction in the Pell system. [PELL.DAT] 

NSLDS Loans Data File — Contains raw loan-level data received from the NSLDS for 
the 7,475 BPS sample members who received loans between May 1, 1995, and July 1, 
2001 . Provides a history file with separate records for each transaction in the loan files. 
[NSLDS.DAT] 

Weights File — Contains six weights with a separate record for each respondent. A set of 
analysis and Balanced Repeated Replication (BRR) weights are provided for cross- 
sectional analyses of the BPS: 1996/2001 population. In addition, a set of longitudinal 
analysis and BRR weights are provided (1) for respondents in all three interviews (base 
year and first and second follow-ups) and (2) respondents in the base year and second 
follow-up. [F01WGT.DAT] 

NPSAS:96 Files — Contains the restricted data file created for the base year interview, 
NPSAS:96. Provides the derived variables and interview data (including the verbatim 
and school files), plus data from NSLDS and CPS file matches. Cumulative enrollment 
data by school, updated during the BPS: 1996/2001 interview, have been deleted from the 
file. [FILE196.DAT, CODE96.DAT, SCATI96.DAT, PCATI96.DAT, INST96.DAT, 
CADEV96.DAT, CATIV96.DAT, SAT. DAT, ACT. DAT, AP.DAT, PSS.DAT, 

CCD. DAT] 

BPS:96/98 Files — Contains the restricted data file created for the first follow-up 
interview of the BPS:96 cohort, BPS:96/98. Provides the derived variables and interview 
data (including verbatim strings and school file), plus data from NSLDS and CPS file 
matches. Cumulative enrollment data by school, updated during the BPS: 1996/2001 
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interview, have been deleted from the file. [FILE198.DAT, F98STUD.DAT, 
F98PREL.DAT, F98METH.DAT, F98SCH.DAT, F98ALPH.DAT, CPS98.DAT] 



B. Data Coding and Editing 

The BPS: 1 996/2001 data were coded and edited using procedures developed and 
implemented for previous NCES-sponsored studies. These coding and editing procedures were 
refined during the field test for use in the processing of BPS: 1996/2001 full-scale data. 

The coding and editing procedures fell into two categories: 

• online coding and editing performed during data collection, and 

• post-data-collection editing. 

1. Online Coding and Editing 

BPS: 1996/2001 included one major data collection system — the CATI/CAPI interview — 
designed to include edit checks to ensure that the data collected were within valid ranges. To the 
extent feasible, the system incorporated across-item consistency edits. The CATI system also 
included online coding systems used for the collection of industry, occupation, and major, as 
well as a coding module used to obtain IPEDS information for new postsecondary institutions 
attended since the last interview. 

2. Post-Data-Collection Editing 

During and following data collection, the CATI/CAPI data were reviewed to confirm that 
the data collected reflected the intended skip-pattern relationships. At the conclusion of data 
collection, special codes were inserted in the database to reflect the different types of missing 
data. There are a variety of explanations for missing data within individual data elements. 

Table 5.1 lists the set of special codes used to assist analysts in understanding the nature of 
missing data associated with BPS: 1996/2001 data elements. 



Table 5.1. — Description of missing data codes 



Missing data code 


Description 


-1 


Don’t know 


-2 


Refused 


-3 


Legitimate skip (item was intentionally not collected because variable was not applicable 
to this student) 


-6 


Bad data, out of range, uncodeable userexit string 


-7 


Item was not administered (abbreviated interview) or reached (partial interview) 


-8 


Item was not reached due to a CATI error 


-9 


Missing for unknown reasons 



SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS:1996/2001). 
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Skip-pattern relationships in the database were examined by systematically running 
cross-tabulations between gate items and their associated nested items. In many instances, gate- 
nest relationships had multiple levels within the CATI/CAPI instrument. That is, items nested 
within a gate question may themselves have been gate items for additional items. Therefore, 
validating the gate-nest relationships often required significant iterations and multiway cross- 
tabulations. In some instances, additional across-item consistency checks were performed, 
although these checks were kept to a minimum since, without recontacting respondents, it was 
difficult to know which data item was the true source of the inconsistency. 

The data cleaning and editing process for the BPS: 1996/2001 consisted of the following 

steps: 



Step 1. Review of one-way frequencies for every variable to confirm no missing or blank 
values. This involved replacing blank or missing data with -9 for all variables in the 
CATI database and examining frequencies for reasonableness of data values. 

Step 2. Review of two-way cross-tabulations between each gate-nest combination of 
variables to check data consistency. This step required using CATI/CAPI source code as 
specifications to define all gate-nest relationships and replace -9 codes (missing for 
unknown reason code) with -3 codes (legitimate skip code) as appropriate. Where the 
two-way cross-tabulations revealed either unusually high numbers of nonreplaced -9 
codes, or unusually high numbers of responses for items which should have been 
skipped, the situation was investigated to ensure skip-pattern integrity. In some instances 
the inconsistency was due to a backup in the interview that changed the value of the gate 
question. In other cases resolution involved reprogramming the gate-nest relationship to 
be consistent with the CATI instrument. In rare instances this check revealed errors in 
the CATI source code. 

Step 3. Identify and specially code items that were not administered due to a partial or 
abbreviated interview. This step involved replacing -9 and -3 values with -7 (item not 
administered) based on the section completion and abbreviated interview indicators. This 
-7 code, which was used for the first time in BPS:96/98, allows analysts to easily 
distinguish those items that were not administered to the respondent due to a partial or 
abbreviated interview from items skipped or left blank unintentionally. 

Step 4. Identify items requiring recoding and logical imputations. Standard variable 
recoding and formatting (e.g., formatting dates as YYYYMM) and standardizing units of 
time (where an item collected amount of time in a variety of units) were performed 
during this step. Logical imputations were performed where items were missing but 
their values could be implicitly determined. For instance, if the respondent did not work 
in 2000, then the amount earned was imputed to $0 rather than -3 or -9. Items that were 
skipped because the respondent did not answer the gate question (don’t know or refusal) 
were imputed to the value of the gate question (-1 or -2). 

Step 5. Identify out-of-range or outlier values. One-way frequencies for all categorical 
variables and descriptive statistics for all continuous variables were examined. Values 
determined to be out-of-range or unreasonable were replaced with -6. 
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Concurrent with the data cleaning process, detailed documentation was developed to 
describe question text, response options, logical imputations, recoding, and the “applies to” text 
for each delivered variable. 



C. BPS: 1996/2001 Descriptive Report 

The descriptive report, a separate BPS: 1996/2001 publication, documents some of the 
significant results from the longitudinal data collection. It includes an essay on persistence and 
attainment at 4-year institutions and a table compendium updating key variables for student 
characteristics, education and employment experiences, finances, and civic participation created 
using the BPS: 1996/2001 DAS. 
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Development of statistical analysis weights for the BPS: 1996/2001 sample is discussed in 
section A below. Cross-sectional weights were constructed for analyzing the respondents to 
BPS: 1996/2001. In addition, two longitudinal weights were constructed, one for analyzing the 
students who participated in all three interviews — NPSAS:96, BPS:96/98, and BPS: 1996/2001 — 
and another for analyzing the students who responded to NPSAS:96 and BPS:1996/2001. 
Analysis procedures that can be used to produce design-unbiased estimates of sampling 
variances are discussed in section B, including variances computed using Taylor series and 
balanced repeated replications (BRR) techniques. Section C discusses the accuracy of 
BPS: 1996/2001 estimates in terms of both precision and potential for bias. This section includes 
survey design effect tables that illustrate the level of precision achieved by the BPS: 1996/2001 
survey for key analytic outcomes for several important analysis domains. Finally, section D 
gives weighted response rates. 

A. Analysis Weights 

The initial file for the BPS: 1996/2001 sample contained approximately 

• 10,300 BPS:96/98 respondents and 

• 1,800 BPS: 96/98 nonrespondents. 

As noted in chapter 2, the final BPS: 1996/2001 sample consisted of 

• the eligible respondents to BPS:96/98 and 

• a subsample of nonrespondents to BPS:96/98 who were NPSAS:96 respondents. 

Among these, over 20 were identified as deceased either prior to data collection and after data 
collection began. 

A statistical analysis weight was computed to be used for analyzing data from the 
BPS:1996/2001 respondents. In addition, two longitudinal weights were computed: a weight for 
analyzing those BPS: 1996/2001 respondents who also responded to NPSAS:96 and BPS:96/98, 
and a weight for analyzing the BPS: 1996/2001 respondents who only responded to NPSAS:96 
and BPS: 1996/2001. 

The weights for the BPS: 96/98 respondents were constructed by applying a series of 
adjustments for subsampling and nonresponse to the base weights for the 2001 follow-up of the 
BPS:96 cohort, namely B01IAWT 1 . Specifically, four adjustments were made: 



1 The rationale for the variable name “B01IAWT” is the following: 
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• to account for subsampling of the BPS:96/98 nonrespondents; 

• to account for those not located; . 

• to account for refusals, among those who were located; and 

• to account for types of nonresponse other than refusals among those who were 
located and did not refuse. 

Construction of the longitudinal weight for those who responded to all three surveys consisted of 
an additional adjustment for nonresponse to either NPSAS:96 or BPS:96/98. Construction of the 
analysis weight for those who responded to both NPSAS:96 and BPS: 1996/2001, but not to 
BPS:96/98, consisted of an additional adjustment for nonresponse to NPSAS:96. 

1 . Base Weight for BPS:1996/2001— Adjustment for Subsampling of 
BPS:96/98 Nonrespondents 

As discussed in chapter 2, a subsample of BPS:96/98 nonrespondents was included in 
BPS: 1996/2001 . The subsample, rather than all nonrespondents, was fielded in order to reduce 
data collection costs. The weight B01LAWT was adjusted for those students, j, in the subsample 
by multiplying by the inverse of their selection probabilities. These probabilities take into 
account the stratification and probability proportional to size (PPS) sampling that was used in 
selecting the subsample. The adjustment was 

ADJlj = l/jtj. 

The weight was calculated as: 

B01_100U = B01IAWT * ADJ1, for students in the BPS:96/98 nonrespondent 

subsample 

= B01IAWT for all other students. 

The weights B01100U for the students in the subsample were then adjusted so that they 
summed to the weight sum of B01IAWT for the BPS:96/98 nonrespondents. This adjustment 
resulted in the initial sampling weight for the BPS: 1996/2001 sample, which is denoted 
B01_100. B01100 was further adjusted to produce the BPS:1996/2001 analysis weights, as 
described below. 

The weight B01100 is nonnegative for both the eligible and ineligible (i.e., deceased) 
students. Weighted response rate tables later in this chapter were computed using B01_100 and 
were based on the set of eligible students. The eligible students are those with B01ELIG=1 
where B01ELIG is the eligibility indicator for BPS: 1996/2001. 



B denotes the BPS survey 

0 1 denotes the year 200 1 

I stands for “initial” 

A stands for “analysis” 

WT stands for “weight” 
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2. BPS:1996/2001 Cross-Sectional Weights 

Analysis weights were constructed for the respondents to BPS: 1996/2001 . The weights 
were constructed by applying adjustments to the base weight B01_100. This section describes 
each of the adjustment steps, the variables considered for the adjustments, and the variables in 
the final weight adjustment models. 

The adjustment for nonresponse was performed in three steps because the predictors of 
response propensity were potentially different for each of the following outcomes: 

• inability to locate the student, 

• refusal to be interviewed, and 

• other noninterview. 

Using these three steps of nonresponse adjustment achieved greater reduction in nonresponse 
bias to the extent that different variables were significant predictors of nonresponse propensity at 
each step. 

All nonresponse adjustments were fit using RTI’s proprietary generalized exponential 
modeling procedure (GEM 2 ), which is similar to logistic modeling using bounds for adjustment 
factors. A key feature and advantage of the GEM software is that the nonresponse adjustment 
and weight trimming and smoothing are all accomplished in one step. Lower and upper bounds 
are set on the weight adjustment factors. The bounds can be varied, depending on whether the 
weight falls inside or outside a range, such as one defined by the bounds (median — 3 times the 
interquartile range, median + 3 times the interquartile range). This allows different bounds to be 
set for adjustments for weights that are considered high extreme, low extreme, or nonextreme. In 
this way, the extreme weights can be controlled and the design effect due to unequal weighting 
reduced. 

Candidate predictor variables selected were those thought to be predictive of nonresponse 
and nonmissing for most of the sample (nonrespondents as well as respondents). The candidate 
predictor variables included 

• age (categorical); 

• typical age for a beginning student (yes or no); 

• race/ethnicity; 

• gender; 

• citizenship status in the base year; 

• attendance status in the base year; 

• level of institution attended in the base year; 

• control of institution attended in the base year; 

• region of institution attended in the base year; 

• size of institution attended in the base year (categorical); 

• applied for financial aid in the base year (yes or no); 



2 Folsom, R.E. and. Singh, A.C. (2000). “The Generalized Exponential for Sampling Weight Calibration 
for Extreme Values, Nonresponse, and Poststratification.” Proceedings of the Section on Survey Research Methods 
of the American Statistical Association, pp. 598-603. 
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• receipt of federal aid in the base year (yes or no); 

• receipt of Pell Grant in the base year (yes or no); 

• receipt of Stafford Loan in the base year (yes or no); 

• receipt of state aid in the base year (yes or no); 

• receipt of institutional aid in the base year (yes or no); 

• receipt of any aid in the base year (yes or no); 

• previous response status (whether the student was a respondent to both NPSAS:96 
and BPS:96/98 versus a nonrespondent to either NPSAS:96 or BPS:96/98); 

• income of independent students and parents of dependent students (collapsed); 

• parents' highest educational attainment; 

• degree completion status in 1998; 

• number of telephone numbers available; 

• number of times an answering machine was encountered (for located students); and 

• whether the student was in a field cluster. 

To detect important interactions for the logistic models, a Chi-squared automatic 
interaction detection analysis (CHAID) was performed on the predictor variables. The CHAID 
analysis divided the data into segments that differed with respect to the response variable 
(located, did not refuse, or respondent, depending on the model). The segmentation process first 
divided the sample into groups based on categories of the most significant predictor of response. 
It then split each of these groups into smaller subgroups based on other predictor variables. It 
also merged categories of a variable that were found to be nonsignificant. This splitting and 
merging process continued until no more statistically significant predictors were found (or until 
some other stopping rule was met). The interactions from the final CHAID segments were then 
defined. 

The nonresponse bias for these same variables was estimated, and then a statistical test of 
whether or not the bias was significant was performed. Tests were performed to identify 
significant differences between refusal conversions and other respondents; significant differences 
suggest a potential for nonresponse bias because of the refusal population being different from 
the other respondents. Additional tests were performed to detect significant differences between 
late respondents and other respondents; significant differences would suggest a potential for 
nonresponse bias because of the noncontacts/late-contact population being different from the 
other respondents. Results and further details of these analyses are given below in section C. 

The interaction segments and all the main effects were then subjected to variable 
screening in the GEM logistic procedure. Variables with significant bias were included in each 
nonresponse model. The models initially included all of the potentially important variables. The 
interaction segments identified by CHAID were also retained in all of the models. The most 
nonsignificant variables were deleted sequentially until the deletion of additional variables did 
not appreciably improve the unequal weighting effect (UWE). Different bounds on the weight 
adjustments, depending on whether the weight is classified as high extreme, nonextreme, or low 
extreme, were applied within the NPSAS:96 institutional sampling strata to accomplish 
nonresponse adjustment, truncation, and smoothing in one step. A large number of predictor 
variables in each nonresponse model were kept. This allows the estimates to be calibrated based 
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on the respondents to as many totals as possible that are known for both respondents and 
nonrespondents. 

a. Weight Adjustment for Nonrespondents Who Were Not Located 

Of the individuals eligible for the BPS : 1996/2001 sample, 92 percent was contacted. An 
adjustment was performed to the weight BO 11 00 to adjust for the remaining sample members 
who did not respond because they were not located. As described above, a CHAID analysis was 
performed on all of the predictor variables to detect important interactions. All potentially 
important variables were included in the model. Highly nonsignificant variables were deleted 
from the model until there was little change in the unequal weighting effect. 

Table 6.1 presents the final predictor variables used in the logistic model that adjusted 
the weights for those who were not located, and gives the weighted location rate and the average 
weight adjustment factors resulting from these variables. The weighting adjustment factor for 
student j was the reciprocal of the predicted response probability, or 



ADJ2j = 1/ pLj. 



The weight, adjusted for those who were not located, was then computed as 

LOCWT = BO 11 00 * AJD2 for those who were located 
= 0 otherwise. 

b. Weight Adjustment for Nonrespondents Who Refused 

Of the sample members who were eligible and located for the BPS: 1996/2001 sample, 3 
percent refused. An additional adjustment was performed to the weight that had been adjusted 
for the not located, LOCWT, to adjust for those who refused. As in the case of the adjustment 
for the not located, a CHAID analysis was performed on all of the predictor variables to detect 
important interactions. All potentially important variables were included in the model. Highly 
insignificant variables were deleted from the model until there was little change in the unequal 
weighting effect. 

Table 6.2 presents the final predictor variables used in the logistic model that adjusted 
the weights for those who refused and gives the weighted nonrefusal rate (for those who were 
located) and the average weight adjustment factors resulting from these variables. The weighting 
adjustment factor for student j was the reciprocal of the predicted response probability, or 



ADJ3j — 1/pNRefj. 



The weight adjusted for those who refused was computed 

NREFWT = LOCWT * ADJ3 for those who did not refuse 
= 0 otherwise. 
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Table 6.1. — Average weight adjustment factors from the logistic model used to adjust for 
student location nonresponse 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Total 


89.6 


1.45 


Age 




1.40 


19 or younger 


91.7 


20 to 23 


80.6 


1.77 


24 to 29 


92.5 


1.69 


30 to 39 


78.7 


1.49 


40 or older 


94.1 


1.74 


Gender 






Male 


87.3 


1.50 


Female 


91.5 


1.42 


Citizenship 




1.45 


U.S. citizen 


89.8 


Non-citizen 


86.2 


1.55 


Attendance status 




1.42 


Full-time/full year 1 institution 


92.7 


Full-time/full year more than 1 institution 


96.1 


1.05 


Full-time/part year 


79.8 


1.59 


Part-time/full year 1 institution 


88.6 


1.57 


Part-time/full year more than 1 institution 


60.4 


1.55 


Part-time/part year 


91.8 


1.61 


Institution level 






4-year 


91.7 


1.44 


2-year 


89.0 


1.45 


Less-than-2-year 


83.3 


1.57 


Institution control 






Public 


90.2 


1.41 


Private not-for-profit 


91.0 


1.49 


Private for-profit 


83.5 


1.55 


Institution region 




1.65 


New England 


90.1 


Mid East 


91.4 


1.30 


Great Lakes 


91.7 


1.37 


Plains 


93.5 


1.57 


Southeast 


86.9 


1.46 


Southwest 


88.2 


1.56 


Rocky Mountains 


84.4 


1.67 


Far West 


89.8 


1.45 


Outlying Area 


92.3 


1.19 


Type of institution and enrollment category 
Public 






Fewer than 1,000 


95.5 


1.09 


1,000-2,499 


81.1 


1.79 


2,500-4,999 


91.6 


1.42 


5,000-9,999 


81.6 


1.54 


10,000-19,999 


92.6 


1.43 


20,000 or more 


94.4 


1.32 


Private not-for-profit 




1.32 


Fewer than 1,000 


84.5 


1,000-2,499 


93.7 


1.54 


2,500-4,999 


93.6 


1.42 


5,000-9,999 


84.6 


1.45 


10,000 or more 


92.3 


1.58 


Private for-profit 




1.63 


Fewer than 300 


82.0 


300-999 


86.6 


1.49 


1 ,000 or more 


79.7 


1.59 



See footnotes at end of table. 
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Table 6.1. — Average weight adjustment factors from the logistic model used to adjust for 
student location nonresponse — Continued 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Applied for aid 






Yes 


88.6 


1.42 


No 


91.9 


1.62 


Receipt of federal aid 






Yes 


88.8 


1.43 


No 


90.3 


1.49 


Receipt of Pell Grant 






Yes 


86.9 


1.47 


No 


90.7 


1.44 


Receipt of Stafford Loan 






Yes 


88.8 


1.45 


No 


89.9 


1.46 


Receipt of state aid 






Yes 


93.4 


1.32 


No 


88.9 


1.49 


Receipt of institution aid 






Yes 


92.4 


1.42 


No 


88.8 


1.47 


Receipt of any aid 






Yes 


89.2 


1.42 


No 


90.1 


1.54 


Parents’ income (for dependent students) 






$29,999 or less 


88.7 


1.45 


$30,000-$59,999 


94.1 


1.35 


$60,000-$99,999 


92.8 


1.41 


$100,000 or more 


85.8 


1.56 


Student’s income (for independent students) 






$5,000 or less 


85.2 


1.55 


$5,000-$9,999 


82.6 


1.66 


$10,000-$ 19,999 


75.0 


1.80 


$20,000 or more 


94.6 


1.53 


Parents’ highest education 






Less than high school 


86.1 


1.44 


High school diploma 


88.1 


1.41 


Some college 


90.9 


1.47 


Degree status in 1998 




1.44 


Certificate or associate degree received 


90.5 


Vocational or associate degree program, degree not received 


87.2 


1.49 


Bachelor’s degree or program 


92.3 


1.44 


Telephone numbers available 




1.71 


0 or 1 number 


88.2 


2 numbers 


88.2 


1.59 


3 numbers 


91.4 


1.37 


4 numbers 


90.3 


1.48 


5 numbers 


93.7 


1.39 


6 numbers 


83.8 


1.46 


7 or more numbers 


83.2 


1.40 


Number of times answering machine was encountered 






None 


91.4 


1.38 


Once 


92.8 


1.36 


More than once 


88.1 


1.50 



See footnotes at end of table. 
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Table 6.1. — Average weight adjustment factors from the logistic model used to adjust for 
student location nonresponse — Continued 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Interaction segments 




1.41 


1 =Not in field cluster, White, non-Hispanic, certificate, associate’s, 


91.2 


or bachelor’s received, or in bachelor’s program 






2 = Not in field cluster, White, non-Hispanic, vocational or associate’s 


82.0 


1.66 


degree program 




1.71 


3 = Not in field cluster, other than White non-Hispanic, did not 


70.5 


encounter answering machine 






4 = Not in field cluster, other than White non-Hispanic, encountered 


84.8 


1.72 


answering machine 1 or more times 






5 - In field cluster, White, non-Hispanic, attended full-time/full year 


96.0 


1.42 


6 = In field cluster, White, non-Hispanic, attended other than 


93.9 


1.46 


full-time/full year 






7 - In field cluster, Black, non-Hispanic, male 


84.2 


1.46 


8 - In field cluster, Black, non-Hispanic, female 


93.9 


1.42 


9 - In field cluster, Hispanic, 0 or 1 telephone numbers available 


78.2 


1.40 


10 - In field cluster, Hispanic, 2-4 telephone numbers available 


94.4 


1.18 


1 1 = In field cluster, Hispanic, 5 or more telephone numbers available 


90.1 


1.52 


12 = In field cluster, other race, not typical age for level 


93.9 


1.76 


13 - In field cluster, other race, typical age for level 


95.1 


1.32 



NOTE: Predictor variables are from base year data (NPSAS:96) with the exception of degree status in 1998, telephone numbers 
available, number of times answering machine was encountered, and certain interaction variables. 

The weight used is BO 1 _1 00. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students Longitudinal 
Study: 1996/2001 (BPS: 1996/2001). 
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Table 6.2. — Average weight adjustment factors from the logistic model used to adjust for 
student refusal nonresponse 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Total 


96.1 


1.04 


Age 






1 9 or younger 


96.6 


1.03 


20 to 23 


95,2 


1.05 


24 to 29 


94.5 


1.06 


30 to 39 


94.9 


1.06 


40 or older 


97.0 


1.03 


Race/ethnicity 




1.04 


White, non-Hispanic 


95.8 


Black, non-Hispanic 


97.4 


1.03 


Hispanic 


96.9 


1.03 


Asian/Pacific Islander 


96.5 


1.03 


American Indian/Alaska Native 


97.8 


1.02 


Other 


93.5 


1.08 


Gender 






Male 


96.8 


1.03 


Female 


95.6 


1.04 


Citizenship 




1.04 


U.S. citizen 


96.1 


Non-citizen, eligible 


96.7 ‘ 


1.03 


Non-citizen, ineligible 


98.4 


1.00 


Attendance status 






Full-time/full year 1 institution 


96.2 


1.03 


Full-time/full year more than 1 institution 


97.0 


1.03 


Full-time/part year 


95.8 


1.05 


Part-time/full year 1 institution 


97.0 


1.03 


Part-time/full year more than 1 institution 


96.7 


1.01 


Part-time/part year 


95.3 


1.05 


Institution level 






4-year 


97.0 


1.03 


2-year 


95.8 


1.05 


Less-than-2-year 


94.2 


1.06 


Institution control 






Public 


96.4 


1.03 


Private not-for-profit 


96.9 


1.03 


Private for-profit 


93.6 


1.06 


Institution region 






New England 


93.8 


1.06 


Mid East 


94.4 


1.05 


Great Lakes 


96.1 


1.03 


Plains 


95.1 


1.05 


Southeast 


97.2 


1.03 


Southwest 


96.2 


1.03 


Rocky Mountains 


98.6 


1.02 


Far West 


96.5 


1.03 


Outlying Area 


97.9 


1.02 


Type of institution and enrollment category 
Public 






Fewer than 1,000 


95.6 


1.06 


1,000-2,499 


96.2 


1.05 


2,500-4,999 


99.5 


1.00 


5,000-9,999 


95.5 


1.05 


10,000-19,999 


96.0 


1.03 


20,000 or more 


1 96.3 


1.03 



See footnotes at end of table. 
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Table 6.2. — Average weight adjustment factors from the logistic model used to adjust for 
student refusal nonresponse — Continued 


Predictor variables 


Weighted response rate 


Average weight adjustment 


Private not-for-profit 




1.05 


Fewer than 1,000 


96.0 


1,000-2,499 


97.3 


1.03 


2,500-4,999 


95.9 


1.04 


5,000-9,999 


97.7 


1.02 


10,000 or more 


97.1 


1.03 


Private for-profit 




1.06 


Fewer than 300 


94.6 


300-999 


93.4 


1.07 


1,000 or more 


92.3 


1.07 


Applied for aid 




1.04 


Yes 


96.2 


No 


96.1 


1.03 


Receipt of federal aid 




1.04 


Yes 


95.9 


No 


96.3 


1.03 


Receipt of Pell Grant 




1.04 


Yes 


96.0 


. No 


96.2 


1.03 


Receipt of Stafford Loan 




1.04 


Yes 


95.6 


No 


96.4 


1.03 


Receipt of state aid 




1.03 


Yes 


96.8 


No 


96.0 


1.04 


Receipt of institution aid 




1.03 


Yes 


97.2 


No 


95.9 


1.04 


Receipt of any aid 




1.04 


Yes 


96.3 


No 


95.9 


1.04 


Parents’ income (for dependent students) 


95.8 


1.04 


$29,999 or less 


$30, 000-$5 9,999 


96.6 


1.03 


$ 60 ,000-$9 9,999 


96.0 


1.04 


$100,000 or more 


97.6 


1.02 


Student’s income (for independent students) 


96.6 


1.04 


$5,000 or less 


$5,000-$9,999 


93.3 


1.07 


$10,000-$ 19,9 99 


96.1 


1.05 


$20,000 or more 


95.9 


1.04 


Parents’ highest education 




1.03 


Less than high school 


96.9 


High school diploma 


95.7 


1.04 


Some college 


96.3 


1.03 


Degree status in 1998 


93.1 


1.07 


Certificate or associate degree received 


Vocational or associate degree program, degree not received 


96.4 


1.03 


Bachelor’s degree or program 


97.0 


1.03 


Telephone numbers available 




1.07 


0 or 1 number 


93.6 


2 numbers 


94.9 


1.06 


3 numbers 


96.4 


1.04 


4 numbers 


96.2 


1.04 


5 numbers 


97.6 


1.02 


6 numbers 


95.7 


1.04 


7 or more numbers 


98.1 


1.02 



See footnotes at end of table. 
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Table 6.2. — Average weight adjustment factors from the logistic model used to adjust for 
student refusal nonresponse — Continued 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Number of times answering machine was encountered 






None 


96.6 


1.03 


Once 


97.8 


1.02 


More than once 


95.6 


1.04 


In field cluster 






Yes 


95.8 


1.04 


No 


96.5 


1.03 


Interaction segment 




1.20 


1 = Not prior respondent, not typical age for level 


85.4 


2 - Not prior respondent, typical age for level 


92.3 


1.09 


3 - Prior respondent, 0 or 1 telephone numbers available 


94.1 


1.07 


4 = Prior respondent, 2 telephone numbers available, encountered answering 
machine none or once 


98.1 


1.03 


5 = Prior respondent, 2 telephone numbers available, encountered answering 


91.4 ' 


1.10 


machine more than once 






6 = Prior respondent, 3 telephone numbers available, encountered answering 


97.7 


1.02 


machine none or once 






7 = Prior respondent, 3 telephone numbers available, encountered answering 


96.0 


1.05 


machine more than once 






8 = Prior respondent, 4 or more telephone numbers available, 4-year institution 


97.9 


1.02 


9 = Prior respondent, 4 or more telephone numbers available, 2-year or less-than- 


96.7 


1.04 


2-year institution 







NOTE: Predictor variables are from base year data (NPSAS.96) with the exception of degree status in 1998, telephone numbers 
available, number of times answering machine was encountered, in field cluster, and certain interaction variables. 

The weight used is LOCWT. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 
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c. Weight Adjustments for Located Nonrespondents Who Were Not 
Refusals 

Of the 9,259 who were eligible, located, and did not refuse, 9,132 responded to the 
BPS: 1996/2001 survey and the remaining 127 did not respond for reasons other than refusal. 
Next, an adjustment was made to NREFWT to adjust for these 127. As in the case of the other 
adjustments, a CHAID analysis was performed on all of the predictor variables to detect 
important interactions. All potentially important variables were included in the model. 

Highly insignificant variables were deleted from the model until there was little change in the 
unequal weighting effect. 

Table 6.3 presents the final predictor variables used in the logistic model that adjusted 
the weights for those who were interviewed, and gives the weighted interview rate (for those 
who were located and did not refuse) and the average weight adjustment factors resulting from 
these variables. The weighting adjustment factor for student j was the reciprocal of the predicted 
response probability, or 



ADJ4j = 1/pRj 

and the weight was computed as: 

BO 1 AWT = NREFWT * ADJ4 for the 9,132 who responded, and 
= 0 otherwise. 

This final weight was rounded to the nearest integer and is denoted by BO 1 AWT. This weight is 
to be used for analyzing the data collected from the 9,132 responses to BPS: 1996/2001. 

3. Longitudinal Analysis Weights 

Two longitudinal weights were constructed: 

• one weight (B01LWT1) was computed for the 8,934 eligible NPSAS:96 sample 
members who responded to all three rounds of the survey (i.e., responded to 
NPSAS:96, BPS:96/98, and BPS: 1996/2001); and 

• the second weight (B0 1 LWT2) was computed for the 8,999 eligible NPSAS:96 
sample members who responded to both BPS: 1996/2001 and NPSAS:96. 

These two weights were each constructed by applying additional nonresponse adjustments to the 
final BPS: 1996/2001 cross-sectional weight (i.e., B01AWT). 
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Table 6.3. — Average weight adjustment factors from the logistic model used to adjust for 
nonresponse other than refusal 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Total 


98.8 


1.01 


Age 






1 9 or younger 


98.8 


1.01 


20 to 23 


99.5 


1.01 


24 to 29 


98.3 


1.02 


30 to 39 


97.8 


1.03 


40 or older 


99.6 


1.01 


Race/ethnicity 




1.01 


White, non-Hispanic 


99.0 


Black, non-Hispanic 


98.8 


1.01 


Hispanic 


98.1 


1.02 


Asian/Pacific Islander 


97.9 


1.02 


American Indian/Alaska Native 


98.4 


1.01 


Other 


98.7 


1.02 


Gender 






Male 


98.5 


1.02 


Female 


99.1 


1.01 


Attendance status 






Full-time/full year 1 institution 


99.0 


1.01 


Full-time/full year more than 1 institution 


99.0 


1.01 


Full-time/part year 


98.3 


1.02 


Part-time/full year 1 institution 


98.6 


1.02 


Part-time/full year more than 1 institution 


100.0 


1.00 


Part-time/part year 


99.0 


1.02 


Institution level 






4-year 


98.6 


1.01 


■ 2-year 


99.2 


1.01 


Less-than-2-year 


97.6 


1.02 


Institution control 






Public 


99.1 


1.01 


Private not-for-profit 


98.3 


1.02 


Private for-profit 


97.8 


1.02 


Institution region 






New England 


96.7 


1.04 


Mid East 


98.5 


1.02 


Great Lakes 


99.3 


1.01 


Plains 


99.3 


1.01 


Southeast 


99.3 


1.01 


Southwest 


99.3 


1.01 


Rocky Mountains 


99.5 


1.01 


Far West 


98.0 


1.02 


Outlying Area 


99.2 


1.01 


Type of institution and enrollment category 
Public 






Fewer than 1,000 


98.9 


1.01 


1,000-2,499 


99.7 


1.01 


2,500-4,999 


99.8 


1.00 


5,000-9,999 


99.1 


1.01 


10,000-19,999 


98.6 


1.02 


20,000 or more 


99.2 


1.01 


Private not-for-profit 




1.00 


Fewer than 1,000 


99.6 


1,000-2,499 


98.0 


1.02 


2,500-4,999 


99.3 


1.01 


5,000-9,999 


97.5 


1.03 


10,000 or more 


98.1 


1.02 



See footnotes at end of table. 
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Table 6.3. — Average weight adjustment factors from the logistic model used to adjust for 
nonresponse other than refusal — Continued 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Private for-profit 






Fewer than 300 


98.9 


1.01 


300-999 


98.0 


1.02 


1 ,000 or more 


95,7 


1.04 


Applied for aid 






Yes 


98.8 


1.01 


No 


98.8 


1.02 


Receipt of federal aid 






Yes 


98.9 


1.01 


No 


98.7 


1.02 


Receipt of Pell Grant 






Yes 


99.0 


1.01 


No 


98.8 


1.01 


Receipt of Stafford Loan 






Yes 


98.7 


1.01 


No 


98.9 


1.02 


Receipt of state aid 






Yes 


99.2 


1.01 


No 


98.7 


1.02 


Receipt of institution aid 






Yes 


98.6 


1.02 


No 


98.9 


1.01 


Receipt of any aid 




1.01 


Yes 


98.9 


No 


98.7 


1.02 


Prior respondent 




1.01 


Yes 


98.8 


No 


99.7 


1.00 


Parents 1 income (for dependent students) 




1.02 


$29,999 or less 


98.5 


$30,000-559,999 


98.9 


1.01 


$60,000-599,999 


99.1 


1.01 


$100,000 or more 


98.6 


1.02 


Students income (for independent students) 




1.02 


$5,000 or less 


98.5 


$5,000-59,999 


98.7 


1.02 


$10,000-519,999 


99.3 


1.01 


$20,000 or more 


98.9 


1.02 


Parents 1 highest education 




1.03 


Less than high school 


97.7 


High school diploma 


99.1 


1.01 


Some college 


98.8 


1.01 


Degree status in 1998 




1.01 


Certificate or associate degree received 


99.2 


Vocational or associate degree program, degree not received 


98.9 


1.01 


Bachelor’s degree or program 


98.5 


1.02 


Telephone numbers available 




1.02 


0 or 1 number 


98.7 


2 numbers 


98.7 


1.02 


3 numbers 


99.4 


1,01 


4 numbers 


98.0 


1.02 


5 numbers 


99.1 


1.01 


6 numbers 


99.0 


1.01 


7 or more numbers 


99.1 


1,01 


Number of times answering machine was encountered 






None 


98.7 


1.02 


Once 


99.5 


1.01 


More than once 


98.7 


1.02 



See footnotes at end of table. 
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Table 6.3. — Average weight adjustment factors from the logistic model used to adjust for 
nonresponse other than refusal — Continued 



Predictor variables 


Weighted response rate 


Average weight adjustment 


In field cluster 






Yes 


98.4 


1.02 


No 


99.2 


1.01 


Interaction segment 




1.01 


1 = U.S. citizen, male 


98.8 


2 - U.S. citizen, female 


99.1 


1.01 


3 - Non-citizen, public institution 


97.9 


1.03 


4 = Non-citizen, private institution, male 


86.4 


1.16 


5 = Non-citizen, private institution, female 


97.6 


1.03 



NOTE: Predictor variables are from base year data (NPSAS:96) with the exception of prior respondent, degree status in 1998, 
telephone numbers available, number of times answering machine was encountered, and in field cluster. 

The weight used is NREFWT. 

SOURCE: U.S. Department of Education, national Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 
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The weight for analyzing respondents to all three surveys, NPSAS:96, BPS:96/98, and 
BPS: 1996/2001, was constructed by applying an additional nonresponse adjustment to the final 
unrounded BPS:1996/2001 cross-sectional weight (unrounded version of B01AWT). As for the 
other models, CHAID was used to determine the interaction segments, then the GEM modeling 
procedure was used to determine the adjustment factor. Table 6,4 presents the final predictor 
variables used in the logistic model that adjusted the weights for those who were not also 
interviewed in both NPSAS:96 and BPS:96/98 and gives the weighted interview rate (for those 
who were interviewed in BPS.T996/2001) and the average weight adjustment factors resulting 
from these variables. The final weight was rounded to integer values, and is denoted as 
- B01LWT1. 

Specifically, 

B01LWT1 = BO 1 AWT * ADJ5 for those who responded to all three surveys, and 
= 0 otherwise, 

where 

ADJ5 = l/p96-98-01j 

is the reciprocal of the predicted response probability. 

The weight for analyzing respondents to both NPSAS:96 and BPS: 1996/2001 was also 
constructed by applying an additional nonresponse adjustment to the final BPS.T 996/2001 cross- 
sectional weight, following the same steps as for the other adjustments. Table 6.5 presents the 
final predictor variables used in the model and the weighted response rates and adjustment 
factors. The final weight was rounded to integer values and is denoted as B01LWT2. 

Specifically, 

B01LWT2 = B0 1 AWT * ADJ6 for those who responded to both NPSAS:96 
and BPS: 1996/2001, and 
= 0 otherwise, 

where 

ADJ6 = l/p96-oij 

is the reciprocal of the predicted response probability. 

The distributions of the weight adjustment factors for the BPS: 1996/2001 analysis 
weights and the two longitudinal weights are presented in table 6.6. Table 6.7 presents the 
distributions of the initial, intermediate, and final weights along with their unequal weighting 
design effects. 
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Table 6.4. — Average weight adjustment factors from the logistic model used to adjust for 
nonresponse to either NPSAS:96 or BPS:96/98, among the respondents to 
BPS:1996/2001 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Total 


93.6 


1.05 


Age 






19 or younger 


93.7 


1.05 


20 to 23 


94.9 


1.05 


24 to 29 


91.7 


1.10 


30 to 39 


96.5 


1.03 


40 or older 


89.9 


1.09 


Race/ethnicity 


93.4 


1.05 


White, non-Hispanic 


Black, non-Hispanic 


94.0 


1.06 


Hispanic 


94.5 


1.05 


Asian/Pacific Islander 


93.7 


1.05 


American Indian/Alaska Native 


93.9 


1.05 


Other 


94.7 


1.05 


Gender 






Male 


93.5 


1.05 


Female 


93.7 


1.05 


Citizenship 


93.7 


1.05 


U.S. citizen 


Non-citizen, eligible 


94.7 


1.04 


Non-citizen, ineligible 


81.3 


1.22 


Attendance status 




1.05 


Full-time/full year 1 institution 


94.4 


Full-time/full year more than 1 institution 


100.0 


1.00 


Full-time/part year 


95.4 


1.05 


Part-time/full year 1 institution 


91.5 


1.08 


Part-time/full year more than 1 institution 


100.0 


1.00 


Part-time/part year 


90.4 


1.10 


Institution level 






4-year 


95.4 


1.05 


2-year 


92.2 


1.08 


Less-than-2-year 


93.7 


1.06 


Institution control 






Public 


93.0 


1.06 


Private not-for-profit 


97.6 


1.02 


Private for-profit 


92.2 


1.09 


Institution region 




1.06 


New England 


93.6 


Mid East 


96.3 


1.03 


Great Lakes 


93.6 


1.05 


Plains 


91.9 


1.07 


Southeast 


94.9 


1.05 


Southwest 


90.6 


1.09 


Rocky Mountains 


93.4 


1.06 


Far West 


92.3 


1.07 


Outlying Area 


97.9 


1.02 


Type of institution and enrollment category 
Public 






Fewer than 1,000 


100.0 


1.00 


1,000-2,499 


94.7 


1.05 


2,500-4,999 


93.6 


1.05 


5,000-9,999 


95.2 


1.04 


10,000-19,999 


90.6 


1.09 


20,000 or more 


92.9 


1.07 



See footnotes at end of table. 
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Table 6.4. — Average weight adjustment factors from the logistic model used to adjust for 
nonresponse to either NPSAS:96 or BPS:96/98, among the respondents to 
BPS:1996/2001 — Continued 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Private not-for-profit 






Fewer than 1,000 


98.5 


1.02 


1,000-2,499 


97.2 


1.03 


2,500-4,999 


98.1 


1.02 


5,000-9,999 


98.4 


1.01 


10,000 or more 


97.0 


1.03 


Private for-profit 






Fewer than 300 


93.1 


1.07 


300-999 


90.5 


1.11 


1,000 or more 


93.9 


1.06 


Applied for aid 






Yes 


94.7 


1.05 


No 


91.2 


1.08 


Receipt of federal aid 






Yes 


94.2 


1.05 


No 


93.2 


1.06 


Receipt of Pell Grant 






Yes 


93.7 


1.06 


No 


93.6 


1.05 


Receipt of Stafford Loan 






Yes 


93.9 


1.05 


No 


93.5 


1.05 


Receipt of state aid 






Yes 


95.5 


1.04 


No 


93.3 


1.06 


Receipt of institution aid 






Yes 


95.3 


1.04 


No 


93.2 


1.06 


Receipt of any aid 






Yes 


94.6 


1.05 


No 


92.3 


1.07 


Parents’ income (for dependent students) 






$29,999 or less 


93.4 


1.06 


$30,000-$59,999 


94.2 


1.05 


$60,000-$99,999 


93.1 


1.06 


$1 00,000 or more 


94.2 


1.05 


Student’s income (for independent students) 






$5,000 or less 


93.6 


1.06 


$5,000-$9,999 


93.4 


1.07 


$10,000-$ 19,999 


97.5 


1.02 


$20,000 or more 


90.8 


1.09 


Parents’ highest education 






Less than high school 


93.0 


1.06 


High school diploma 


94.1 


1.05 


Some college 


93.4 


1.06 


Degree status in 1998 






Certificate or associate degree received 


93.2 


1.07 


Vocational or associate degree program, degree not received 


92.5 


1.07 


Bachelor’s degree or program 


95.3 


1.05 


Interaction segment 






1 = Did not apply for aid, public 


94.4 


1.06 


2 = Did not apply for aid, private not-for-profit 


97.8 


1.02 


3 — Did not apply for aid, private for-profit 


91.6 


1.09 


4 = Applied for aid, public 


90.6 


1.09 


5 = Applied for aid, private not-for-profit 


96.3 


1.04 


6 — Applied for aid, private for-profit 


97.2 


1.04 



NOTE: Predictor variables are from base year data (NPSAS:96) with exception of degree status in 1998. Weight used is BO 1 AWT. 



SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 
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Table 6.5. — Average weight adjustment factors from the logistic model used to adjust for 
nonresponse to NPSAS:96, among the respondents to BPS:1996/2001 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Total 


96.3 


1.04 


Age 






1 9 or younger 


96.1 


1.04 


20 to 23 


96.7 


1.03 


24 to 29 


95.2 


1.05 


30 to 39 


97.6 


1.02 


40 or older 


97.9 


1.02 


Race/ethnicity 






White, non-Hispanic 


96.3 


1.04 


Black, non-Hispanic 


96.4 


1.04 


Hispanic 


96.0 


1.04 


Asian/Pacific Islander 


97.2 


1.03 


American Indian/Alaska Native 


93.9 


1.05 


Other 


94.7 


1.04 


Gender 






Male 


95.9 


1.04 


Female 


96.7 


1.03 


Citizenship 






U.S. citizen 


96.3 


1.04 


Non-citizen, eligible 


95.8 


1.05 


Non-citizen, ineligible 


100.0 


1.00 


Attendance status 






Full-time/full year 1 institution 


96.7 


1.04 


Full-time/full year more than 1 institution 


100.0 


1.00 


Full-time/part year 


97.0 


1.03 


Part-time/full year 1 institution 


93.9 


1.07 


Part-time/full year more than 1 institution 


100.0 


1.00 


Part-time/part year 


96.0 


1.04 


Institution level 






4-year 


96.3 


1.04 


2-year 


96.3 


1.03 


Less-than-2-year 


96.4 


1.03 


Institution control 






Public 


95.8 


1.05 


Private not-for-profit 


98.4 


1.02- 


Private for-profit 


96.5 


1.03 


Institution region 






New England 


97.4 


1.03 


Mid East 


98.3 


1.02 


Great Lakes 


96.2 


1.04 


Plains 


95.7 


1.04 


Southeast 


97.5 


1.03 


Southwest 


93.2 


1.07 


Rocky Mountains 


96.4 


1.04 


Far West 


94.9 


1.05 


Outlying Area 


97.9 


1.02 


Type of institution and enrollment category 






Public 






Fewer than 1,000 


100.0 


1.00 


1,000-2,499 


97.3 


1.03 


2,500-4,999 


96.6 


1.03 


5,000-9,999 


97.0 


1.03 


10,000-19,999 


94.8 


1.06 


20,000 or more 


95.3 


1.05 


Private not-for-profit 






Fewer than 1,000 


99.0 


1.01 


1,000-2,499 


98.4 


1.02 


2,500-4,999 


98.6 


1.01 


5,000-9,999 


98.7 


1.01 


10,000 or more 


97.7 


1.02 


Private for-profit 






Fewer than 300 


95.3 


1.04 


300-999 


97.0 


1.03 


1 ,000 or more 


97.5 


1.03 



See footnotes at end of table. 
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Table 6.5. — Average weight adjustment factors from the logistic model used to adjust for 
nonresponse to NPSAS:96, among the respondents to BPS:1996/2001— 
Continued 



Predictor variables 


Weighted response rate 


Average weight adjustment 


Receipt of federal aid 




1.04 


Yes 


96.3 


No 


96.3 


1.03 


Receipt of Pell Grant 




1.04 


Yes 


95.9 


No 


96.5 


1.03 


Receipt of Stafford Loan 






Yes 


96.2 


1.04 


No 


96.3 


1,04 


Receipt of state aid 






Yes 


96.9 


1.03 


No 


96.2 


1.04 


Receipt of institution aid 






Yes 


96.3 


1.04 


No 


96.3 


1.04 


Receipt of any aid* 




1.04 


Yes 


96.5 


No 


96.0 


1.04 


Parents’ income (for dependent students) 




1.05 


$29,999 or less 


95.1 


$30,000-$59,999 


96.5 


1.03 


$60,000-599,999 


95.8 


1.04 


$100,000 or more 


97.9 


1.02 


Student’s income (for independent students) 




1.03 


$5,000 or less 


97.1 


$5,000-$9,999 


95.9 


1.04 


$10,000-$ 19,999 


98.2 


1.02 


$20,000 or more 


96.4 


1.03 


Parents’ highest education 




1.07 


Less than high school 


94.2 


High school diploma 


96.9 


1.03 


Some college 


96.2 


1.04 


Degree status in 1998 




1.03 


Certificate or associate degree received 


97.3 


Vocational or associate degree program, degree not received 


96.0 


1.04 


Bachelor’s degree or program 


96.3 


1.04 


Interaction segment 




1.05 


1 = Did not apply for aid 


95.0 


2 = Applied for aid, NPSAS:96 school greater than 13,184 enrolled 


95.2 


1.05 


3 = Applied for aid, NPSAS.96 school 13,184 or fewer enrolled, 


93.0 


1.07 


parents’ highest education less than high school 




1.02 


4 = Applied for aid, NPSAS:96 school 13,184 or fewer enrolled. 


98.2 


parents’ highest education high school or more 







NOTE: Predictor variables are from base year data (NPSAS:96) with the exception of degree status in 1998. 



The weight used is BO 1 AWT. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 
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Table 6.6.— Distribution of values for the BPS:1996/2001 weight adjustment factors 



Quantile 


Location 

ADJ2 


Refusal 

ADJ3 


Nonresponse 

ADJ4 


Longitudinal 

ADJ5 


Longitudinal 

AJD6 


Minimum 


0.05 


0.56 


0.91 


0.89 


0.87 


1% 


0.26 


1.00 


1.00 


1.00 


1.00 


5% 


0.94 


1.00 


1.00 


1.00 


1.00 


10% 


1.04 


1.00 


1.00 


1.01 


1.00 


25% 


1.18 


1.01 


1.00 


1.02 


1.01 


Median 


1.38 


1.02 


1.01 


1.04 


1.02 


75% 


1.68 


1.05 


1.02 


1.07 


1.05 


90% 


2.05 


1.09 


1.04 


1.12 


1.09 


95% 


2.29 


1.12 


1.06 


1.15 


1.11 


99% 


2.65 


1.19 


1.15 


1.24 


1.17 


Maximum 


3.49 


1.30 


1.20 


1.42 


1.35 



NOTE: Some adjustment factors are less than one because the GEM weight adjustment procedure adjusts for nonresponse, and 
truncates and smooths the weights in one step. 

SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 



Table 6.7. — Distribution of initial, intermediate, and final weights for BPS:1996/2001 





Initial weight 


Intermediate weights 


Cross-sectional 
analysis weight 


Longitudinal a 


nalysis weights 


Quantile 


B01 100 


Location 

LOCWT 


Refusal 

NREFWT 


B01AWT 


B01LWT1 


B01LWT2 


Minimum 


7 


8 


8 


8 


8 


8 


1% 


22 


27 


27 


28 


28 


28 


5% 


38 


50 


51 


52 


54 


53 


10% 


47 


67 


68 


69 


72 


71 


25% 


76 


113 


116 


118 


120 


120 


Median 


144 


196 


202 


205 


211 


209 


75% 


225 


324 


334 


340 


347 


344 


90% 


680 


782 


795 


800 


824 


823 


95% 


1,151 


1,278 


1,333 


1,354 


1,392 


1,384 


99% 


2,964 


3,125 


3,185 


3,197 


2,987 


3,140 


Maximum 


12,897 


5,588 


4,453 


4,340 


4,357 


4,371 


Unequal weighting 












3.026 


design effect 


7.245 


3.020 


3.059 


3.047 


3.024 



SOURCE: U.S. Department of Education, National Center for Education Statistics, Beginning Postsecondary Students 
Longitudinal Study: 1996/2001 (BPS: 1996/2001). 
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